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Abstract 


Recent  papers  have  brought  out  the  intimate  connection  that 
exists  between  the  problems  of  detection  and  estimation  in  many 
conmunication  problems.  In  many  cases,  it  becomes  necessary  to  simul- 
taneously detect  and  extract  the  signal  so  that  the  two  operations 
must  be  jointly  optimized  in  order  to  improve  receiver  performance. 
However,  the  receiver  structures  so  obtained  are  considerably  more 
complicated  than  conventional  receivers  and  it  is  not  clear  that  this 
added  complexity  results  in  a significant  improvement  in  receiver 
performance.  This  report  investigates  a specific  suboptimal  scheme 
for  estimating  a signal  in  the  presence  of  uncertainty  regarding 
its  reception,  namely,  the  probabilistic  estimation  scheme.  The 
scheme  uses  a probabilistic  judgement  to  determine  the  presence  of 
the  signal  in  the  observations.  In  order  to  obtain  explicit 
estimation  algorithms  and  to  facilitate  performance  evaluation, 
a specific  problem  of  signal  estimation  under  uncertainty,  namely 
the  tracking  of  a target  in  a multi-target  multi-sensor  environment 
is  considered.  Explicit  algorithms  for  this  problem  are  developed 
and  the  performance  of  the  receiver  is  evaluated  under  conditions 
involving  fading  media  or  quantized  observations. 
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1 . INTRODUCTION 


Problems  of  signal  estimation  and  detection  which  are  central  to 
communication  systems  have  received  considerable  attention  in  recent 
years.  While  for  the  most  part  the  solutions  to  these  problems  have  been 
obtained  assuming  that  they  are  essentially  independent  operations  [1], 
recent  papers  have  brought  out  the  intimate  connection  that  exists  between 
the  two  [2-9].  In  many  cases,  it  becomes  necessary  to  simultaneously 
detect  and  extract  the  signal  so  that  the  two  operations  must  be  coupled 
in  order  to  improve  receiver  performance.  Such  examples  occur  in  all 
circumstances  where  properties  of  the  signal  source  or  transmission  medium, 
signal  presence  or  absence,  or  signal  identification  are  jointly  desired 
and  may  arise  in  such  diverse  fields  as  underwater  communications,  seismic 
detection,  pattern  recognition  or  radar  ranging  and  tracking.  For  example, 
in  communication  over  fading  dispersive  channels,  the  exact  value  of  chan- 
nel attenuation  is  not  known.  In  a multitarget,  raultisensor  environment » 
it  becomes  necessary  to  simultaneously  identify  and  track  the  target.  In 
a pattern  classification  problem,  the  particular  pattern  giving  rise  to 
the  received  signal  is  unknown. 

The  basic  problem  may  be  formulated  as  follows.  The  data  z received 
during  each  observation  interval  I consists  of  (a)  a signal  y corrupted  by 
observation  noise  v (Hypothesis  H^)  or  (b)  noise  v alone  (Hypothesis  H^) . 

We  assume  that  z and  y are  vectors  consisting  of  sampled  values.  The  sig- 
nal y is  assumed  to  be  a realization  of  a stochastic  process.  We  are  re- 
quired to  recursively  estimate  either  the  signal  y or  some  parameters  0 
associated  with  it. 
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We  thus  write 


Hi  1 \ “ yk  + \ 
H0  = zk  = vk 


for  k e I 


The  receiver  is  required  to  make  two  functionally  related  decisions  at 
the  end  of  each  observation  interval,  the  first  on  the  presence  or  ab- 
sence of  the  signal  (a  detection  decision)  and  the  second  on  the  va,  e 
of  the  signal  (or  parameter)  vector  (an  estimation  decision). 

Let  us  define  R^  as  the  risk  associated  with  detecting  the  signal 
presence  and  as  the  risk  associated  with  estimating  the  signal.  If 
the  two  operations  of  detection  and  estimation  can  be  carried  out  in 
parallel  or  simultaneously,  but  independently  and  the  two  components  R^ 
and  Rg  can  be  minimized  separately,  then  the  two  problems  are  said  to  be 
uncoupled  [4,5].  The  detection  procedure  is  carried  out  in  the  usual 
fashion  but  the  estimator  performance  has  to  be  optimized  in  the  face  of 
uncertainty  as  to  the  signal  presence.  In  many  situations  such  a sep- 
aration of  costs  is  not  possible  and  the  two  operations  are  said  to  be 


coupled . 


We  can  write  the  observation  equation  for  the  joint  detection- 


estimation  problem  as 

Zk  “ 6 yk  + Vk 


for  k e I 


where  B is  a binary  random  variable  taking  on  the  values  0 or  1 . The 
problem  is  then  referred  to  as  a single-shot  problem,  and  the  same  hy- 
pothesis is  true  over  the  entire  observation  interval.  6 is  called 
the  indicator  variable. 

Let  denote  the  sequence  [z^.z^. . .z^] . Assuming  a quadratic 
cost  for  the  estimation  problem,  let  the  estimate  of  the  signal  y^  be 
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denoted  as  y^  and  let  y^  denote  the  estimate  assuming  hypothesis  to 

aQ  r ■» 

be  true.  It  is  clear  that  y.  * Ely.  / = y 

J k k y. 

k 

By  expanding  in  mixtures,  we  can  then  write  the  estimate  as 

yk  = I yk  p<H±|zk) 

i-0,1 


(1.3) 


If  Pq  and  are  the  prior  probabilities  associated  with  Hq  and 


H^,  define 


P,  p(Zkl«  ) 

\ = P“ 

K *0 


P(zklHo) 


and 


\ 


(1.4) 


(1.5) 


k 1 + A, 


Then  an  alternative  form  of  Eq . (1.3)  is  [8] 


yk  " yk  + ' rk>\ 

This  differs  from  the  form  obtained  by  Middleton  and  Esposito  [4]  in  the 
presence  of  the  second  term. 

It  can  be  further  shown  [11]  that 


rk=E(8|Zk}  (1.7) 

It  can  be  seen  from  Eq.  (1.4)  that  obtaining  the  optimal  estimate  requires 
computation  of  the  likelihood  function  Ak>  The  structure  of  the  optimal 
estimator  consists  of  an  estimator  for  each  hypothesis,  the  outputs  of 
which  are  combined  in  a suitably  weighted  fashion  to  give  the  optimal 
estimate. 

If  the  active  hypothesis  changes  from  observation  to  observation, 
we  have  the  multi-shot  problem.  In  this  case,  8k  can  be  modeled  as  an 
independent  random  sequence  taking  on  the  values  0 and  1.  At  stag-  k, 
the  possible  values  of  8^,  •••  3k  come  from  a set  of  2 binary  seq  ences 
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WT-?' 


of  length  k.  Let  these  sequences  be  denoted  by  , j = 1,2  ...  k. 
Then  we  can  write 


yk  “ £ E{yJZk,0j}  p*a;jlZk*  (1-8) 

t h k 

The  estimator  at  the  k stage  consists  of  a bank  of  2 filters  (one  for 
each  possible  sequence),  whose  outputs  are  then  combined  appropriately. 

It  is  clear  from  Eq.  (1.8)  that  this  leads  to  an  exploding  memory  requir- 
ment . 

It  is  well-known  [12]  that  the  exploding  memory  problem  can  be  re- 
placed by  one  that  requires  storing  complete  density  functions.  The  rele- 
vant densities  may  be  recursively  computed  if  there  exists  a fixed  dimen- 
sional sufficient  statistic. 

Various  suboptimal  schemes  have  been  proposed  to  eliminate  the 

exploding  memory  requirements  of  Eq.  (1.8).  The  linear  estimator  [13] 

does  provide  one  such  scheme  in  which  a single  filter  is  used  to  provide 

the  best  linear  estimator.  Other  schemes  essentially  choose  one  or  a 

k k 

finite  number  of  the  2 possible  sequences  in  some  manner.  This  reduces 
the  summation  in  Eq.  (1.7)  to  a summation  over  a fixed  number  of  terms 
and  hence  leads  to  a fixed  number  of  estimators  in  the  implementation. 

In  this  report,  we  will  investigate  a specific  suboptimal  scheme 
for  estimating  a signal  in  the  presence  of  uncertainty  regarding  its 
reception,  namely,  the  probabilistic  estimation  scheme  [14,15].  Explicit 
algorithms  for  estimation  using  the  scheme  are  developed  and  the  per- 
formance of  the  estimator  under  several  conditions  is  evaluated.  In 


order  to  obtain  explicit  algorithms,  a specific  problem  of  signal  estima- 
tion under  uncertainty,  namely  the  tracking  of  a target  in  a multi-target 


multi-sensor  environment  is  considered.  While  the  algorithms  presented 
in  this  report  have  been  developed  explicitly  for  this  problem,  they 
can  easily  be  extended  to  other  situations  involving  similar  models. 

The  report  is  organized  as  follows.  In  Chapter  II,  the  specific 
problem  considered  in  the  report  is  formulated  and  the  observation 
models  presented.  The  structure  of  the  probabilistic  estimator  for 
this  problem  is  developed  in  Chapter  III  and  its  performance  compared 
with  that  of  two  other  suboptimal  estimators  presented  earlier  in  the 
literature. 

As  indicated  earlier,  in  all  problems  where  there  is  uncertainty 
regarding  the  signal  presence  in  the  observations,  the  two  operations  of 
signal  detection  and  estimation  must  be  carried  out  simultaneously.  Thus 
the  cost  associated  with  these  two  problems  must  be  optimized  jointly. 
However  the  receiver  structures  so  obtained  are  usually  complicated. 

Hence  it  is  desirable  to  compare  their  performance  with  suboptimal  re- 
ceivers to  determine  whether  a significant  improvement  results  from  the 
increased  complexity  of  the  receiver.  Chapter  IV  presents  such  a compari- 
son between  a receiver  obtained  using  the  coupled  risk  formulation  and 
the  scheme.  Chapter  V presents  an  evaluation  of  the  probabilistic  scheme 
in  fading  media  while  Chapter  VI  develops  algorithms  for  estimation  with 
quantized  observations. 
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2.  SIGNAL  AND  OBSERVATION  MODELS 


2.1  Introduction 


The  problem  of  tracking  a target  such  as  a space  vehicle  in  multi- 
target and  multi-sensor  environments  has  received  considerable  attention 
recently  [1-3].  When  observations  from  several  sensors  are  supplied  to 
the  receiver,  it  must  decide  which,  if  any,  of  the  sensor  meaurements  cor- 
responds to  the  target  being  tracked  and  suitably  process  these  measure- 
ments to  update  the  estimate  of  the  target  state.  In  real  time  surveil- 
lance systems,  tracking  filter  performance  heavily  depends  on  two  functions. 
The  first  function  is  referred  to  as  sensor  return-to-track  correlation 
[3-5].  In  the  presence  of  multiple  targets  a sensor  return  could  originate 
from  one  of  three  possible  sources  (hypotheses).  The  return  could  be 
from  the  object  being  tracked,  could  originate  from  one  of  the  other 
objects  not  being  tracked,  or  could  be  noise  alone.  Sensor  return-to- 
track  correlation  is  a decision  process  serving  to  decide  whether  a 
sensor  return  originated  from  the  object  being  tracked.  The  second 
function,  tracking  filtering,  has  been  developed  in  [5].  The  sensor 
return  which  originates  from  the  object  being  tracked  is  used  in 
track  updating;  otherwise,  there  is  no  updating  and  the  one-stage  pre- 
diction is  used.  The  dynamic  equation  of  motion  of  the  target  being 
tracked  is  assumed  to  be  modelled  by  a set  of  Gauss-Markov  difference 
equations 


Xk+1 " Vk  + rk\ 


yk  “ ckxk 


(2.1) 
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where  is  the  n x 1 state  vector,  and  yk  is  a p x 1 output  vector.  The 

m-vector  10  is  white  zero  mean  guassian  with  covariance  E{u>  0)  } = 0 6 ,<b 

K k e k ke  rk 

is  an  n x n state  transition  matrix  and  is  an  n x m excitation  matrix. 

If  a particular  sensor  is  not  locked  onto  the  target  being 
tracked,  the  corresponding  sensor  return  at  any  stage  can  originate 
from  one  of  three  sources  (hypotheses),  namely,  (i)  noise  alone 
(e.g.,  cut-off  communication  link),  (ii)  false  alarm  (e.g.  thermal 
or  process  noisa,  or  clutter  from  a vehicle  not  being  tracked),  and 
(iii)  the  vehicle  being  tracked.  The  receiver  performs  under 
uncertainty  of  reception  in  a ternary  case  which  considers  all  the 
possibilities.  A suitable  model  for  the  observations  under  each 
hypothesis  can  then  be  formulated  as  follows  [1]: 

Under  h\  the  observation  is  given  by 

: zk  - uk  (2-2) 

* where  the  p x 1 vector  u^  is  a zero  mean,  white  Gaussian  sequence 

2 

with  covariance  Etu^u^}  “ 1^6^.  Under  hypothesis  H , the  current 

observation  is  not  uced  in  track  updating  and  the  one-stage  prediction 

is  used  instead.  The  effect  of  the  extraneous  returns  can  be  modeled 

as  a white,  zero  mean  sequence  v,  , with  covariance  E{v.  v } = ft, 6,  . 

k k m k km 

It  is  therefore  reasonable  to  model  the  observation  z^  under  hypothesis 

H2  as 

h£  : zR  - ck  *(k|k-l)  + vk  (2.3) 

where  x (k(k-l)  is  the  one-stage  prediction  of  x^. 

The  expression  for  the  covariance  kernel,  is  given  in  [5] 
and  for  p **  2 simplifies  to 
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(2.4) 


C.V  . .C.T  + R 
n c -k  k [k-1  k k 

k 1 + J * " tf^=F 

VherC  \|k-l  " E{(\  " \|k-l)(xk  “ Xk|k-1)T}*  nTF  is  the  exPccted 
number  of  incorrect  returns,  in  the  one  sigma  region 
T *1/2 

ak  *=  mint(ckvic|R_iCR  + Viii  and  P is  the  correlation  between  two 
measurements. 

The  noise  sequences  u,  , v,  , and  V,  are  mutually  independent 

1C 

and  are  uncorrelated  with  respect  to  the  initial  state 

xq  which  is  multivariate  normally  distributed  of  dimension  n with 

mean  p and  covariance  V , N [p  , V ]. 
o o n o o 

3 

Under  H , the  observation  corresponds  to  the  vehicle  being 
tracked  and  hence  we  have 

"k  ! zk  • Vk  + uk 

It  is  well  known  that  the  optimal  Bayes  estimator  for  this 
problem  involves  an  evergrowing  memory  and  is  hence  not  feasible. 

This  is  because  the  estimate  of  at  each  stage  depends  on  the 
sequence  of  past  true  hypotheses.  This  sequence,  however,  is  un- 
known and  hence  the  estimate  must  be  averaged  over  all  possible 
past  sequences.  This  corresponds  to  the  unsupervised  learning 
problem  in  pattern  classification.  It  is  clear  .that  as  the  number 
of  stages  increases,  the  total  number  of  sequences  over  which  the 
estimates  must  be  averaged  increases  very  rapidly.  We,  therefore, 
need  to  resort  to  suboptimal  schemes  which  use  a fixed  memory. 
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2.2  Error  Rnuntb 


As  indicated  above,  the  case  in  which  the  hypothesis  at  each  stage 
associated  with  any  sensor  return  is  not  known,  corresponds  to 
unsupervised  learning.  If  the  hypotheses  are  known,  we  have  supervised 
learning.  In  general,  if  the  classifications  of  the  sensors  are  not 
known  a priori,  unsupervised  learning  must  be  resorted  to.  In  many 
cases  partial  information  about  the  classifications  can  be  gained  by 
making  use  of  all  the  past  observations  as  well  as  all  the  past  classi- 
fications; e.g.,  by  probabilistic  judgements  to  determine  the  classifi- 
cations; this  is  referred  to  as  partially  supervised  learning, 

Intuitively,  tjie  variance  associated  with  partially  supervised 
learning  is  bounded  below  by  that  of  supervised  learning  (learning  with 
a teacher),  and  bounded  above  by  that  of  unsupervised  learning  (optimal 
Bayesian  learning).  These  bounds  for  the  optimal  stochastic  control 
problem  with  finite  state  space  have  been  investigated  in  [6  ],  and 
tighter  bounds  given  in  [7  ].  Similar  bounds  for  the  optimal  filtering 
problem  can  be  established  by  augmenting  the  message  model  of  Eq.  (1) 
such  that 


*Li = *kxi + VA 


xk+i  - f(xk>\) 


(2.6) 

x£  E {0,1}  V k (2.7) 

where  Eq.  (2.6)  can  be  identified  with  Eq.  (2.1)  by  setting  x.^  * x,  and 

K u 

1 2 
OJ^  ■ Eq.  (2.7)  is  the  model  for  switching  hypotheses,  is  white 

zero  mean  gaussian  sequence  uncorrelated  with  U)^  as  well  as  the  initial 
1 2 

state,  and  x^  and  x^  are  independent. 
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(2.8) 


The  corresponding  observation  model  is 

. , 1 2. 

Zk  = h(xk’V  + ck 


which  can  be  identified  with  Eqs.  (2.2)  - (2.4)  by  setting 

1 

U • X.  *=  0 anH  ( _ 

'k  vk 


2 

*k 


1 

Xk"  ° 


if  H.  is  active 
k 


2 

*k 


, i.e.  h[x^,x^]  = 0 
(2.9) 


1 * 011(1  ek  = \ 


if  is  active  (2.10) 


and 


1 

*k"*k 


2 

"k-1 


and  e,  «*  v, 
k k 
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i.e.  hfxR,xk]  = xkCkxR  + vR  if  1^  is  active  (2.11) 

Let  V^o^)  and  V^o^)  denote  the  error  variances  corresponding 

to  the  supervised  estimator  (optimal  Bayesian  estimator  with  known 
classification  of  sensor  returns) , the  unsupervised  estimator 
(optimal  Bayesian  estimator  with  unclassified  sensor  returns)  and  par- 
tially supervised  estimator.  Then,  following  an  argument  similar  to 
[10],  we  can  show  that 

V®(fci(k))  < v£(w(k>)  < v”(w(k))  V k (2.1.2) 


That  is,  the  error  variance  associated  with  the  partially  supervised 
estimator  is  bounded  below  by  that  of  the  optimal  Bayesian  estimator  with 
known  classifications  and  above  by  the  estimator  with  unknown  classifications 

The  proof  of  Eq.  (2.12)  is  by  induction  and  is  as  follows. 

At  the  final  stage  N,  the  optimal  estimate  3^  and  the  correspond- 
ing optimal  return  Mn))  are  evaluated,  respectively,  as 

■ conditional  mean  of  f(xN|ZN)  (2.13) 
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f 


and 


V^(<j(N))  = conditional  variance  of  f(x^|ZN) 


where  u(N)  = f(xN|ZN) 


Hence,  at  the  final  stage  N 
mil 

A 

*N 


(2.14) 


min  E C(x^,xN>N)  = E min  E Ctx^.x^.N) 

ZN  *N  |ZN 

*=  E Vn(uj(N))  (2.15) 

ZN 

where  E denotes  the  mathematical  expectation  with  respect  to  the 

hi 

distribution  of  Z , and  E denotes  the  mathematical  expectation  with 

I ^ 

respect  to  the  conditional  distribution  of  x^  given  Z^. 

Assume  that  the  following  equality  holds  at  stage  k+1 
N 

min  E £ C(x^,x^,i)  = E Vk+^  (oj(k+l) ) ^2  jg) 

W*n  i=k+1  Vl 


Then,  at  stage  k,  application  of  the  principle  of  optimality  yields 

n E l C(x  ,x  ,i)  = min^E  c(*k«xk»k)  + E Vk+1  (u>(k+l) ) 1 

•••x™  1=1  x,.  1 Z,..,  } 


min 

V,,XN 


min|E  E C(xk,xk,k)  + E E 

^ Zk  lZk 


WA 


E min^E  C(x,  ,x  ,k)  + E 


\+l  I \ 


k+1 


Vi 

k+1(u>(k+l))} 

(u)(k+l))| 


E V.  r to(k) ) 


where 


Vv(w(k))  *=  min|E  C(xk>xk,k)  + E 


k+1 


k+1 


(u)(k+l))| 


(2.17) 


l 
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Equations  (2 . 15)- (2. 17)  hold  equally  for  all  the  three  types  of  supervision 
and  corresponding  expressions  are  obtained  by  adding  superscripts  on  the 
optimal  returns. 

It  can  easily  be  shown  that  for  the  cost  function  under 
consideration  the  following  inequalities  hold 


C(VVN)  - min  C(VVN)  - m±n  C(VVN> 


(2.18) 


Since  operations  such  as  minimization  and  conditional  expectation  do 
not  change  the  inequalities  , 


min  E C^.x^N)  > min  E min  CCx^x^N)  > min  E min  C($  ,x  ,N) 

*»  'ZN  |ZN  sjj  lZN  *N 


(2.19) 


which  can  be  reduced  to 


min  E 


E C(xn,xn,N)  > min  E min  C(xN>xN,N)  > E mir.  C(S  ,x  ,N) 

■ • *»  ^ < '2»  4 'Zn  *»  (2.20) 

Utilizing  the  preceding  inequalities  and  making  use  of  the  independence 

of  x^  and  x^,  Eq.  (2.15)  yields 

v£(w(N))  = min  ^ C(xn,xn>N) 

*N  N 

> min  E min  C(x  ,x^N) 

*1  |Z„  ~2 

. min  E 

I Zv, 

*N  N 

- V®(oi(N))  (2.21) 

Assume  that  the  inequality  holds  at  stage  k+1,  i.e. 

V^(w(k))  >.  V*(w(k))  V k (2.22) 
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f 


At  stage  k,  from  Eq.  (2.17) 


v£(u»(k))  = min/E  C^.x^k)  + E v£+iMk+1>)} 
*k  l|zk 


xk  I “k+l'^k 

> min/E  C(x  ,x  ,k)  + E V®  (aj(k+l))\ 

' \'|zk  VllZk  1 


vj;(u(k)) 


(2.23) 


To  establish  the  lower  bound,  it  is  noted  that  Equation  (2.15)  yields 
V*J(u>(N))  - min  E C(^,x  ,N) 

- min  E C(jcJ,-xJ,N)  + min  E C(x^,x^,N) 

*i  |z»  4 |z» 

£ min  E C(x^,x^,N)  + min  E C(x^,x^,N)  (2.24) 

\ |2»  *ii 

where  C^.x  ,N)  = C(xJ,xJ,N)  + C(x^,x^,N)  because  of  the  quadratic  nature 

1 2 

of  the  cost  function  C(j^,xn,N)  and  the  independence  of  xN  and  x^ 


Following  the  same  reasoning  as  in  Eq.  (2.23)  one  can  show 
vP(w(k))  < Vk  (cj(k)j  V k 


(2.25) 


In  subsequent  chapters, we  will  consider  a specific  partially  supervised 
estimator,  namely  the  probablistic  estimator  and  derive  explicit  algorithms 
for  its  implementation  under  various  conditions. 
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3.  THE  PROBABILISTIC  ESTIMATOR 


3.1  Introduction 


As  discussed  in  Chapter  I,  the  optimal  Bayesian  estimator  for 
tracking  a target  in  a multi-target,  multi-sensor  environment  is  not 
feasible  because  of  its  exploding  computational  requirements.  In 
this  chapter,  we  will  present  a suboptimal  scheme,  namely  the  probabilistic 
estimator  which  uses  a probabilistic  judgement  at  each  stage  of  the 
observations  to  determine  the  true  hypothesis.  The  scheme  is  evaluated 
by  comparing  its  performance  with  two  other  suboptimal  schemes  which 
have  been  presented  in  the  literature. 

As  explained  in  Chapter  II,  the  dynamic  equation  of  motion  of  the 
target  being  tracked  is  assumed  to  be  modelled  by  the  set  of  Gauss- 
Markov  difference  equations 

Xk+1  = Vk  + rk\ 


yv  = ckxk 

where  x^  is  the  n x 1 state  vector,  and  y^  is  a p x 1 output  vector. 

The  m-vector  is  white  zero  mean  guassian  with  covariance  E }=Q^6^e , 

is  an  nxn  state  transition  matrix  and  T.  is  an  nxm  excitation  matrix. 

k 

The  observations  at  any  stage  can  originate  from  one  of  three 
sources  (hypotheses),  namely,  (i)  noise  alone  (e.g.,  cut-off  communication 
link),  (ii)  false  alarm  (e.g.  thermal  or  process  noise,  or  clutter  from 
a vehicle  not  being  tracked),  and  (iii)  the  vehicle  being  tracked.  The 

observations  are  thus  given  by  [1,2] 

H?  : z « u (3.2) 

k k k 

where  the  p x 1 vector  u^  is  a zero  mean,  white  Gaussian  sequence  with 
covariance  Efu^u^}  = 


(3.3) 


= c^  x(k|k-l)  + 


where  x(k|k-l)  is  the  one-stage  prediction  of  x,  and  V,  is  a white. 

k k ’ 

zero  mean  sequence  with  covariance  E(v,  v } = ft  6 

km  k km 


Hk  = Zk  = CkXk  + Uk 


(3.  A) 


The  noise  sequences  v^>  and  are  mutually  independent  and 
are  uncorrelated  with  respect  to  the  initial  state  xQ  which  is  multivariate 
normally  distributed  of  dimension  n with  mean  and  covariance  V^, 

N„  ‘V  V- 


It  is  well  known  that  the  optimal  Bayes  estimator  for  this 
problem  involves  an  evergrowing  memory  and  is  hence  not  feasible. 

This  is  because  the  estimate  of  x^  at  each  stage  depends  on  the 
sequence  of  past  true  hypotheses.  This  sequence,  however,  is  un- 
known and  hence  the  estimate  must  be  averaged  over  all  possible 
past  sequences.  This  corresponds  to  the  unsupervised  learning 
problems  in  pattern  classification.  It  is  clear  that  as  the  number 
of  stages  increases,  the  total  number  of  sequences  over  which  the 
estimates  must  be  averaged  increases  very  rapidly.  We,  therefore, 
need  to  resort  to  suboptimal  schemes  which  use  a fixed  memory. 

There  are  basically  two  approaches  which  can  be  taken  to  solve  this 
problem.  We  can  use  a fixed  number,  say  K,  of  past  observations  in 
arriving  at  our  estimate.  Alternately,  we  can  use  all  past  obser- 
vations in  our  estimation  scheme  but  choose  only  one  (or  more  generally, 
a fixed  number  N)  of  possible  past  sequences  over  which  to  average. 
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A few  such  schemes  have  been  discussed  in  the  literature.  A statistical 

test  Is  used  in  [3,4]  to  eliminate  the  likely  extraneous  returns. 

The  number  of  sensor  returns  can  be  further  reduced  by  combining 
identical  observations  from  the  most  recent  N stages  [1].  This 

I 

is  essentially  an  estimator  based  on  an  infinite  sequence  of  hypo- 
theses and  a finite  sequence  of  observations.  An  alternative  has 
been  studied  in  [5],  which  is  essentially  an  estimator  based  on  an 
infinite  sequence  of  observations  and  a finite  sequence  of  hypotheses. 

Either  of  them  requires  a tremendous  amount  of  computation.  We  now 

present  the  structure  of  a probabilistically  supervised  estimator  which 

uses  a probabilistically  supervised  learning  sequence  to  determine  the  hypothesis 

corresponding  to  each  sensor  return  and  thus  eliminate  extraneous  returns. 

The  estimator  is  compared  with  two  other  suboptimal  estimators  in  terms 
of  performance  and  computational  requirements.  Simulation  results  are 
presented  to  show  the  applicability  of  the  probabilistically  supervised 
estimator. 

3.2  Probabilistically  Supervised  Estimator 

In  generating  a probabilistically  supervised  learning  sequence, 

I 

one  has  to  design  a probabilistic  judgement  at  each  stage  of  observations, 
on  the  sensor  returns  [6],  This  Judgement  is  carried  out  in  parallel 
on  each  sensor  return,  in  such  a way  that  each  return  is  assigned  a 
classification  label  which  then  serves  as  its  true  hypothesis.  We  can 
think  of  the  scheme  as  consisting  of  a set  of  parallel  processing  units 
(PPU),one  corresponding  to  each  of  the  sensors, and  a central  processing 
unit  (CPU)  which  combines  the  processed  returns  from  the  PPU's  to  update 
the  estimate  of  the  target  track.  On  receiving  the  observations,  each 
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PPU  performs  a random  labelling  test  to  generate  a label  corresponding 

to  each  of  the  three  hypotheses.  The  label  is  then  accepted  as  the  true 

hypothesis  at  that  stage.  Depending  on  the  label,  the  observation  from 

the  i sensor  is  modeled  according  to  Eqs.  (3.2)-(3. 4)by  the  corresponding 

PPU.  The  algorithm  for  combining  the  observation  models  supplied  to  obtain 

the  track  estimate  is  discussed  later.  < 

We  can  now  consider  the  generation  of  the  random  labels. 

Suppose  there  are  M sensors  and  that  at  stage  k,  there  are  n^  returns, 

< H.  Let  2*  denote  the  label  generated  by  the  ith  PPU  at  stage  k 

and  let  L(k)  be  the  vector 

1 7 nk  T 

L(k)  " \ ■ ■ • V1 

M 

Let  l£  denote  the  set  of  all  past  labels  L(j),  j = 1 . . . k. 

Similarly  let  z*  denote  the  i sensor  return  at  stage  k and 


let 


T T n, 

Z(k)  = [z*  z£  • • zk 


M 

Let  denote  the  set  of  all  past  observations  Z(j),  j=l...k. 

Since  there  are  three  hypotheses,  the  label  is  a ternary  random 

variable.  When  a new  measurement  z*  + ^ is  received,  the  PPU  updates 

the  probabilities  associated  with  the  three  outcomes  on  the  basis  of 
■ 

z , the  past  measurement  sequence  Z.  and  the  past  labelling  se- 

k + 1 k 

M 

quence  L^.  We  can  then  write  for  these  probablities 

£<i  ■ "inivi- z"' 


4 L">'f(lltrHkn|2k'  L"> 


(3.5) 


f(4ll<‘  L"> 
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where 


£<vi'z"’  L">  £<vX+i-hm-  <•  $X+X-  l">  «-6> 

for  i *=  1,  2 . . . n^;  j = 1,  2,  3 

If  we  assume  that  the  hypotheses  are  independent  from  stage  to  stage, 
we  can  write 


$ ■ X+i  -Li> 


For  Markov-dependent  hypotheses,  we  have 

f lZW»Ll,>  = F (\+l=Hk+l  I \=V 


(3.7) 


(3.8) 


vk+l  “kHl“k’*V 

It  only  remains  to  determine  f (zk+] I ^k+1  =^k+l*  ^k’  ^k^ " From  t^ie  observation 
model,  Eqs.  (3.1)-(3.3),  we  can  write 


£(vi|tLrHk+i'zk'L”)  ■ V^Vi1 
- V WW  I k°$  ,nk+l  ak)  1 


for  j=l 
for  j=2 


Np  ^Ck+lXk+l  | k ( Lk)  ’ Ck+lVk+l  | k ^Lk^  Ck+1+Rk+1  ^ 

for  j=3 


Define 


(3.9a) 

(3.9b) 

(3.9c) 
(3,.  10) 


i * 1 r 1 | T M M, 

Yk*l  “ Zk+l"E{zk+llLk’  “k} 

which  is  a white  sequence. 

M M 

Let  y be  a collection  of  the  white  sequences 

associated  with  all  the  sensor  returns. 

It  then  follows  that  the  equations  for  the  sequential  estimation  of  the 


state  are  given  by 

. M. 


M. 


\+l|k(Lk)  “ ^kXk(Lk) 


(3.11) 


21 
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Vk4.l|k(Lk>  - Vk(LX  +Wk 

VlO  - *k+l  Ik  (Lk)+1k+i(Lk+l) Yk+1  (Lk+l} 


\+l^Lk+l^  = ^I"\+l(Lk+l)Ck+l^Vk+l|K(Lk) 


where 


_M  .1 

*k+i  " diag[Rk+r 


<+i  1 


Eqs.  (3.5)— (3. 16)  constitute  a complete  set  of  algorithms  for  updating 


the  estimate. 


3.3  Evaluation  of  Probabilistic  Estimator 


We  will  now  briefly  describe  two  suboptimal  estimator  which 
have  been  discussed  in  the  literature  for  purposes  for  comparison 
with  the  probabilistic  estimator.  In  the  first  scheme,  the  infinite 
sequence  of  hypotheses  is  approximated  by  a sequence  of  fixed  size. 

This  scheme  is  referred  to  as  suboptimal  estimator  of  finite  memory. 

The  other  scheme  is  to  eliminate  the  likely  extraneous  returns  by  constructing 
a gate  test.  This  is  referred  to  as  a suboptimal  estimator  with  gating. 

3.3.1  Suboptimal  Estimator  of  Finite  Memory 

The  conditional  density  of  one-stage  predictor  f (xk+1 I ak+1 »zk) 
depends  on  all  the  past  sequences  and  requires  an  evergrowing 

amount  of  core  memory  for  its  implementation.  The  sequence  a^Cot^,  • • • ,a^) 


1 ' I 


could  be  approximated  by  a subsequence  b^  taking  the  most  recent  N 

elements,  bk=(a^, • • • »ak-N+P • As  the  memory  size  is  restricted  to  N,  the 

earliest  element  a,  , , is  not  used  in  evaluating  the  conditional 
k-N+1 

■ M 

density  of  one-stage  predictor.  In  other  words,  f (x^+^ | ai^+i , Z^)  is  an 

/v  s s 

approximated  gaussian  with  mean  x^+.  (b^)  and  variance  ^+1 1 ’ where 

bk  " (°tk»***»ak-N+2^* 


As  a consequence  of  the  approximation,  the  following  equations 

hold 


f (xk+llbk+l,Zk+lJ  = Nnfxk+l|k(bk),Vk+l|k(bk)J 

lf  “k+l  = Hk+1 

. (3.17a) 

" Nn  ^Xk+1 1 k (bk^ ’ Vk+1 1 k (bk) 

lf  “k+l  = Hk+1 

(3.17b) 

" Nn[xk+l(bk+l>'Vk+l(bk+l)] 

lf  “k+l  = Hk+1 

.(3. 17c) 

' ♦A(bk> 


Vi|k(bk>  ■ Wb>l + W 


M. 


*k<bP 


E wvvv 

Sv  “k-N+l 


£ f (bvlz?) 


“k-N+l 


w'"k'T 


1 WW# 


v.  (b‘>  - 77- 

“ “ * fv<»kl7k> 


“k-N+l 


(3.18) 

(3.19) 


(3.20) 


(3.21) 
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jjV, 


f 


where 


W*W  • sk+i|k(bP  + Kk+i(bk+i> lak+rck+isk+i|k(bk)  1 


M „M 


WW  ‘ Vk+l(bk+l)(Ck+l)T<Cl>'1 

(3.22) 

(3.23) 

WW  ' " - WVl>Ck+11Vk«|k(bk> 

(3.24) 

5,1+1  " a "t  Vl<bk+l>-f„(bk+llzk+i> 

Vl  k-N+2 

(3.25) 

Vi  ■ 1 -l  Vi(bk+i>-f„<bk+ilzk+i> 

Tc+1  >N+2 

(3.26) 

The  weighting  coefficients  arc  updated  by 


fw*bk+JZk+l^ 


k+i'^k+ry  ^pk+i 1 V 

, , M I 

f (zk+llZk} 


f(zk+llbk+l»Zk)  N£P[0,Rk+l] 

lf  “k+1  = Hk+1 

- V&iW(b^Wb^ 
if  Vi  = ii 

_M  ~ sx  „M 


(3.27) 


(3.28) 


(3.29) 


lf 

k+1  k+1 


N£p [ Ck+lXk+l |k(bk} * Ck+lVk+l | k (bk} (Ck+l ) T + Rk+1 ] 
,3 

(3.30) 


where  £ denotes  the  number  of  sensor  returns  at  stage  k+1,  and 
£ could  vary  from  stage  to  stage. 

•••*  jOWk>vbk+X> 

\+l  \-N+2 

fv(bk+llZk>  ■ WX-#’1  f„<bX> 


(3.31) 


(3.32) 


“k-N+l 


2A 


f 


i s M 

fw(a^+1|b^#Zk)  ■=  for  independent  hypotheses  (3.33a) 

*=  P(a^+^ja^)  for  markov-dependent  ^ ^3^) 

hypotheses 

It  is  noted  that  the  suboptimal  estimator  of  memory  size  N 
is  identical  to  the  optimal  estimator  under  a single  return  and  the 
suboptimal  estimator  deviates  from  the  optimal  estimator  for  k > N. 

The  sufficient  statistics  at  the  transition  stage  k = N are  x^Cb^)  and 
V^(bjj),  which  are  not  valid  for  evaluating  the  density  of  the  one-stage 
predictor.  Because  of  the  restriction  to  most  recent  N stages,  the 
stage  k-N+1  cannot  be  used  in  evaluating  the  density  of  the  one-stage 
predictor.  This  can  be  overcome  by  simply  averaging  x (b,  ) and  V (b  ) 

K K K.  K. 

with  respect  to  the  weighting  coefficients  at  stage  k-N+1 

\<bP  - I VV  £„<bk|z") 

“k-N+1 


and 


Vbk>  - 1 W £„<bJ# 

“k-N+1 


3.3.2  Suboptimal  Estimator  with  Gating 
In  this  scheme,  only  those  sensor  returns  which  pass  a certain 
gate  test  [3,5]  are  considered  for  updating  a particular  track.  Note 
that  the  innovations  [7]  corresponding  to  the  correct  return  at  stage  k 
is  approximately  normally  distributed  with  zero  mean  and  variance 


Si^k  k-l^k  + Rk*  weH  known  that 


II  " 


:kXk|k-l 


(CkVk|k-lCk  + V 


-1 


(3.34) 
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has  a p degrees  of  freedom  chi-square  distribution.  The  gate  test  is 
to  select  the  threshold  in  such  sensor  return  z*  for  which 

the  quantity  in  Eq.  (3.34)  falls  in  the  right  tail  is  rejected  in  order  to 
eliminate  the  extraneous  returns.  Hence  the  number  of  validated  returns 

could  be  limited  and  utilized  in  track  updating. 

It  is  assumed  that  there  is  a nonzero  probability  of  incorrect  correlation. 


Let  the  set  of  validated  returns  at  stage  k+1  be 
M t 1 1 , 

Vl  " {zk+l’***’Zk+l} 

and  the  accumulated  validated  returns  up  to  stage  k+1  be 


(3.35) 


M r M M-> 

*k+i  ' {2k+i-  •I1> 


(3.36) 


Define  at  each  stage,  the  events 

Cl  «=  {proper  return  is  detected  but  it  is  not  correctly 
correlated]  U {proper  return  is  not  detected] 

and 

■»  {z*+^  correctly  correlated  at  stage  k+1 ] 
where  i «=  1,2, ••*,4;  and  k ■>  1,2,«**  . Assume  that  are  mutually 

exclusive  and  equally  likely  events,  and  that  {zj^j, zk+lJ  are 

independent  and  identically  distributed.  The  probability  of  event  A^+^ 
is,  for  simplicity,  assumed  constant  and  known  a priori. 


The  minimum  variance  estimate  is 

Xk+1  “ | Xk+lf  (xk+JZk+l)dxk+l 
ft 

" JL  j Xk+lf  ^Xk+1  ^ Zk+1  ’ Ak+l^dxk+l  * fw^k+l  ^ Zk+1^ 

i^O  ' 


" E xk+l(Ak+l)*fw(Ak+JZk+l) 
i^O 


(3.37) 


f 
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The  variance  associated  with  the  above  estimate  is  obtained  as  follows 

V, 


k+1 


(xk+l"xk+l) (xk+l~xk+l)  f (xk+l I Zk+l)dxk+l 


n 


^ Vk+1 (Ak+l * fw (Ak+l I Zk+1) 


,M 


ic0 

£ 


+ JQ  \+l(Ak+l)xk+l(Ak+l)‘fw(Ak+llZk+l)  “ xk+lXk+J. 

It  is  to  be  noted  that  the  variance  is  the  sum  of  weighted  variance 
plus  a positive  semidefinite  matrix  to  incorporate  the  effect  of  the 
extraneous  measurements  [3]. 

The  conditional  a posteriori  density  is  calculated  as 

{(zk+l I xk+l ’ Ak+1 ' ' Zk) * f (xk+l I Ak+1 . ’V 


(3.38) 


f(xk+JAk+l,Zk+l) 


k+1 1 k+1*  k+1*  k 
c/  M i.i  „M. 
f (zk+l'Ak+l’ V 


(3.39) 


The  above  expression  can  be  simplified  by  making  use  of  the 

Ai 

on  as 


assumption 


f ^Xk+llAk+l,Zk+l) 


f ( 2k+l  I Xk+1  * Ak+1  * zk}  f K+iK+rfo 


^4+llAk+l*Zk) 


(3.40) 


In  evaluating  Eq.  (3. 40), the  following  should  be  noted: 

1.  The  conditional  density  of  one-stage  predictor  is  independent 

°f  Ak+1’  and  18  GqUal  t0  ^^k+lIk’VlIk1* 

2-  f(*k+iixk+i,Ak+i,zk)  - ywwVi1  for  1 " 

and  either  Np[0,Rk+1l  or  Np t ck+1«k+1 1 R . nk+1 1 for  i=0* 

It  is  clear  from  the  above  that  the  conditional  a posteriori  density  can 

therefore  be  written  as: 
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£ ^Xk+llAk+l,Zk+l^  " Nn^Xk+l(Ak+l),Vk+l^Ak+l^ 


for  i *=  1,2, • • • , & 

“ Nnl*k+l|k,Vk+l|J 
for  i=0 

together  with  the  following  equations  for  updating  the  sufficient 
statistics, 

xk+l(Ak+l^  = Xk+1 | k + Kk+l(Ak+l) ^Zk+l_Ck+lXk+l|k^ 

Kk+l(Ak+l)  = Vk+l|kCk+llCk+lVk+l|kCk+l  + ^+1^ 

Vk+l(Ak+l)  [I  Kk+l(Ak+l)Ck+l]Vk+l|k 
As  expected  the  gain  K^+^(A^+^)  is  independent  of  A*+^  and  is 

the  same  for  each  validated  return. 

The  weighting  coefficients  are  calculated  as  follows: 


Since  f (A* I z“  ) 

w'  k+1 1 k+1 


f ^k+l^+l^k* 

fO<> 

f ^k+rVu^V 

f<Z£+X> 


.+llzk+l’Zk)’i=1’2,  **'»* 


It  follows  from  the  preceding  that 

fv(Ak+l I *k+l ’ Zk)  = fw(Ak+llZk+l,Zk) 


1 “ P^Ak+llZk+l) 


0 M 

where  | Z^+^)  is  known  a priori  [3]. 


Equations  (3.37),  (3.38),  and  (3.41)-(3.47)  constitute  the  necessary 
set  of  equations  for  recursively  updating  the  sufficient  statistics. 
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3.4  Simulations 

In  order  to  evaluate  the  performance  of  the  probabilistically 
supervised  estimator  and  to  compare  it  with  the  other  estimators,  the 
same  tracking  example  given  in  [1]  was  simulated.  The  parameters  of  the 
aircraft  being  tracked  are  given  as  follows 

x - position  at  stage  k 
x - speed  at  stage  k 
y - position  at  stage  k 
y - speed  at  stage  k ] 


change  in  x-speed  between  stage  k and  stage  k+1 
change  in  y-speed  between  stage  k and  stage  k+1 


(2x1) 


♦,  “ 


1 

0 

0 

0 

0 

1 

0 

0 


T 0 

1 0 

0 1 

0 0 

0 
0 
0 
1 


0 

0 

T 

1 


(4x4) 


and 


’a00 


0 

q2200 


(4x2) 

where  T is  the  sampling  interval 


V 


(2x2) 


10  0 0 

Yn00  0 

0 0 10 

(2x4) 

o Y2200 

(2x2) 
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For  simplicity,  the  following  values  are  used 


T - 2 

p ■=  0 

nTF  = °*01 

Xq,  multivariate  normally  distributed  with  dimension  four. 


na[o,g.oi]. 

Q - diag[0,0] 


n.  diag  [0.00828,  0.00828] 

Simulation  results  are  presented  in  the  form  of  tables  for  the  mean  square 

error  associated  with  the  x-position.  Similar  results  were 

also  obtained  for  the  other  states. 

The  simulations  consist  of  a comparison  between  the  three  estimators 

presented  in  this  chapter,  namely  Suboptimal  Estimator  with  Finite  Memory 

(SFE) , Suboptimal  Estimator  with  Grating  (SGE)  and  the  Probabilistically 

Supervised  Estimator  (PSE) . One  hundred  simulations  under  a single  return 

were  averaged  to  obtain  the  mean  square  errors.  The  probability  of  the 
3 12 

hypotheses  were  POi^)  = 1,  PCH^)  “ P(H£)*0.  The  results  are  shown  in  Table  $.1 

which  t Iso  shows  the  mean  square  error  for  the  optimal  Bayesian  Estimator 

(OBE) . For  the  simulations,  an  infinite  gate  size,  unity  detection. probability 
0 M 

of  sensor  and  P (Aj^  I zic+^)  " 0 were  assigned  to  SGE  and  a memory  length  of 
two  stages  to  SFE.  The  simulations  were  carried  to  only  five  stages  be- 
cause of  the  computational  requirements  associated  with  OBE.  The  performance 
of  all  the  estimators  is  equally  acceptable  under  these  conditions. 
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12  3 

Table  3.2  shows  the  results  for  the  case  P(H^)  = PCH^)  “ 0-1  an^  P(H^)=0.8. 
The  decrease  in  MSE  for  the  PSE  is  not  as  fast  as  with  the  OBE  and  SFE. 

However,  as  shown  in  Table  3.3  the  PSE  has  decided  advantages  in  terms  of  com- 
putation requirements. 

The  second  part  of  the  simulations  consist  of  a comparison  between  PSE 
and  SFE.  A one-hundred-step  process  under  five  ground  sensors  was  simulated 
on  the  same  model.  Ten  simulations  were  averaged  to  obtain  the  mean  square 
errors  and  varinaces.  In  Tables  3. 4, 3. 5 and  3. 6, it  is  shown  that  the  performance 
of  PSE  is  better  than  that  of  SFE  in  all  the  possible  cases.  This  fact 
bedomes  more  pronounced  in  the  most  uncertain  environments  in  which  P(H^)“0.2, 
P(hJ)-P(H^)*  0,4. 

For  the  case  of  dependent  hypotheses  transition  probabilities 

P(Hk+lIHk)  Wer<:  chosen  as 


j - 1 


i - 1 
2 
3 


0.8 

0.1 

0.1 


2 

0.1 

0.8 

0.1 


3 

0.1 

0.1 

0.8 


The  process  was  started  initially  with  equally  likely  probabilities 
of  hypotheses.  The  results  shown  in  Table  3.7  indicate  that  the  performance 
of  PSE  is  quite  acceptable  in  this  case. 
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TABLE  3.1 


MSE  of  PSE,  OBE,  SFE,  and  SGE;  P(H^)  « 1 


Stage 

PSE 

OBE 

SFE 

SGE 

1 

.03300 

EH 

.02675 

2 

.01790 

.02109 

11119 

.02331 

3 

.03262 

.03301 

.03301 

.03466 

4 

.03333 

.03202 

.03202 

.03768 

5 

.03079 

.03107 

.03107 

.03195 

TABLE 

3.2 

MSE  of  PSE,  OBE,  SFE,  and  SGE;  P(H*)-P(h£)«0.1 

and  P(H^)=0. 8 


I 

Stage 

PSE 

OBE 

SFE 

SGE 

1 

.34901 

.20144 

.20144 

.17598 

2 

.13036 

.12237 

.22925 

3 

.18304 

.11934 

E 3 

.26749 

4 

.21962 

.08673 

.11383 

.28088 

5 

i 

.14741 

.07520 

.09133 

.47075 

TABLE  3.3 

Percentage  of  Time  and  Core  Locations 


Time1 

Core  2 
Locations 

PSE 

6 

48 

OBE 

100 

100 

SFE 

16 

65 

SGE 

6 

59 

1.  Percentage  of  the  computer  time  required  by  the 
optimal  filter. 

2.  Percentage  of  the  core  locations  required  by  the 
optimal  filter. 


. I 
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TABLE  3.4 

MSE  of  PSE  and  SFE;  p(H^)  = 1 


STAGE 

PSE 

SFE 

10 

.00153 

.01967 

20 

.00101 

,00589 

30 

.00022 

.00294 

40 

.00025 

.00196 

50 

.00019 

.00136 

60 

.00017 

.00101 

70 

.00016 

.00077 

80 

.00019 

.00058 

90 

.00014 

.00042 

100 

.00010 

.00033 

-TABLE  3.5 

MSE  of  PSE  and  SFE;  p(hJ)  = p(H^)  = 0.1  and  p(H^)  = 0.8 


STAGE 

PSE 

SFE 

10 

.02071 

.01439 

20 

.00283 

.02314 

30 

.00159 

.01425 

40 

.00065 

.02270 

50 

.00037 

.03167 

60 

.00025 

.05524 

70 

.00021 

.07709 

80 

.00022 

.07149 

90 

.00019 

.09386 

100 

.00012 

.10536 
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TABLE  3.6 


MSE 


of  PSE  and  SFE;  p(H^)  *>  0.2  and  p(H^)  = p(H^)  = 0.4 


STAGE 


PSE 


10 

.69687 

.10443 

20 

1.13143 

.09342 

30 

.45878 

.08765 

40 

.37909 

.19294 

50 

.32131 

.26892 

60 

.22679 

.48263 

70 

.19707 

.59716 

80 

.16257 

.75582 

90 

.11455 

.87802 

100 

.10383 

.98750 

SI 


TABLE  3.7 

MSE  of  PSE  and  SFE;  Markov-dependent  hypotheses 


STAGE 

PSE 

SFE 

10 

.13949 

1.44858 

20 

.09721 

1 

1.37159 

30 

.10936 

1.06421 

40 

.15072 

.76816 

50 

.23789 

.74486 

60 

.25945 

.89461 

70 

.28760 

1.14302 

80 

.30237 

1.09317 

90 

.34798 

1.30457 

100 

.39394 

1.97090 
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4.  COUPLED  RISK  FORMULATION 


4.1  Introduction 

It  has  been  pointed  out  earlier,  that  In  all  problems  involving 
uncertainty  of  signal  reception,  the  operations  of  detection  and  estimation 
are  intimately  coupled.  The  optimal  processing  of  sensor  returns  requires 
the  mutual  coupling  of  the  costs  associated  with  both  operations.  Signal 
estimation  is  strongly  influenced  by  the  detector's  outcome.  A Bayes' 
optimum  theory  of  Joint  detection  and  estimation  of  signals  in  noise 
has  been  studied  in  [1-4]  by  using  a generalized  cost  function  which 
reflects  the  mutual  coupling  between  the  joint  operations. 

An  estimator  based  on  minimizing  such  a coupled  risk  may  perform 
better  than  the  estimator  in  which  the  two  operations  are  treated  indepen- 
dently. This  improvement  in  performance,  if  any,  will  however  be  at  the 
cost  of  added  complexity  in  the  receiver.  In  this  chapter  a simultaneous 
detection  and  estimation  scheme  is  developed  for  situations  where  the 
signal  evolves  as  a stochastic  process  and  uncertainty  characterizing  the 
presence  of  the  process  in  the  observation  varies  in  the  fashion  discussed 
in  Chapter  2.  Since  the  optimal  processor  again  requires  an  ever-growing 
memory,  a finite  memory  suboptimal  scheme  which  minimizes  the  coupled  risk 
is  developed.  The  performance  of  this  suboptimal  processor  is  compared 
with  that  of  the  probabilistic  estimator  introduced  in  the  previous  chapter. 

4.2  Coupled  Risk  Formulation 

There  are  two  different  kinds  of  risks  associated  with  joint  detection 
and  estimation:  uncoupled  risk  and  coupled  risk.  In  the  uncoupled  risk 

case  the  minimization  of  compound  risk  is  equivalent  to  the  minimization 
of  the  individual  risks  associated  with  the  detection  and  estimation  pro- 
blems respectively.  The  minimization  of  the  decision  risk  leads  to  the 
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familiar  likelihood  ratio  test  [ 1 ) * The  minimization  of  estimation  risk, 

however,  yields  the  problem  in  vhich  the  estimator's  performance  is  to  be 
optimized  under  uncertainty  [5,6].  in  C0Upied  risk  case  the  joint  oper- 
ations of  detection  and  estimation  should  be  coupled  and  the  costs  associated 
with  these  two  operations  chosen  to  reflect  the  interaction  between  them. 

Hence,  the  total  risk  of  joint  detection  and  estimation  can  not  be  expressed 
as  the  sum  of  two  independent  components.  A simultaneous  minimization  of 
the  total  risk  with  respect  to  the  decision  and  the  estimation  rules  is 
necessary.  Joint  detection  and  estimation  of  coupled  risk  has  been  studied 
in  [13,15]  under  a single  return.  This  section  is  devoted  to  the  formula- 
tion of  the  problem  of  optimizing  a coupled  risk  function  under  multi-sensor  returns 
To  reflect  the  coupling  between  detection  and  estimation,  let 
Ci4(xk>xk)  be  defined  as  the  cost  of  choosing  the  hypothesis  nk  and  making  an 
estimate  xk  when,  in  fact,  the  hypothesis  nk  is  true  and  xfc  is  the  true  value 
of  the  signal.  In  other  words,  the  cost  at  stage  k,  can  be  ex- 

pressed as 


VvV  = Lij+Fij'Nv\ll' 


(4.i) 


where  L..  and  F are  chosen  appropriately  to  indicate  the  interaction  be- 

ij  • iJ 

tween  the  joint  operations  of  detection  and  estimation. 

The  corresponding  coupled  risk  is  given  by 


R = 


ft 


cij<VVf<VHV.Zk)dxkdz" 


(4.2) 


Here  A is  the  observation  space  and  A^  is  the  region  in  which  the  decision  nk 

V M 

is  true  so  that  A - A^,  ft  is  the  signal  space,  and  Zk  refers  to  all  the  past 

sensor  returns  accumulated  up  to  stage  k. 

The  optimal  decision  is  then  to  decide  in  favor  of  hypothesis  r|j[  when 


the  following  inequality 
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^ O Cji(xk,xk)f  (xk’Hk,Zk)d\  < I3  J . CU(xk,Xk)’f(xk,Hk,zk)dxk  (4'.3) 

l-i  ^ i=l 


is  satisfied  for  all  14 j ; j,£.=  l,2,3. 

The  estimator  that  minimizes  the  coupled  risk  of  Eq.  (4*2)  is  defined 
as  the  optimal  estimator  under  multi-sensor,  uncertain  environments.  The 
corresponding  optimal  detector  is  given  by  Eq.  (4.3).  Again  it  is  not  feasible 
to  implement  the  optimal  estimator  and  the  optimal  detector,  because  of  the 
need  for  evergrowing  memories.  Therefore  the  suboptimal  alternative  must 
be  resorted  to. 

4.3  Suboptimal  Estimator  and  Detector 

In  this  section,  a suboptimal  estimator  with  finite  memory  require- 
ments for  the  cost  function  of  Eq.  (4.1)  is  developed.  It  is  understood  that 

1 2 

if  the  decision  is  that  either  or  is  true,  there  is  no  updating  in 

the  estimate.  Hence,  there  is  no  cost  incurred  for  estimation  when  either 

12  3 

°r  decided.  In  Eq..  (4.1)  x^  appears  only  when,  in  fact,  Is  true 

(j=3). 


The  cost  associated  with  the  joint  operations  is  described  by  the 
following  equations, 


and 


cll(xk*xk> 

= 

Lll’ 

C12(W 

= 

L12’ 

Cl3(xk,Xk) 

S3 

L13+F 

= 

L21’ 

C22(\’Xk) 

t= 

L22 

C23(VV 

2 

L23+F 

C31(xk,Xk) 

— 

L31+F 

C32(Vxk) 

=5 

L32+F 

C33(VXk) 

2 

L33+F 

T 

,x  x. 
3 k k 

aTa 

LXkxk 

aT  a 


k k 
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The  estimate  obtained  by  minimizing  Eq.  (4.2)  with  respect  to 

x^.  This  is  equivalent  to  minimizing 


f } f 

J-i 


C3j(Sk*Xk)f(xk'Hk’Zk)dxkdZk 


(4.4) 


with  respect  to  x^.  Since  the  cost  C_(xk>xk)  °f  quadratic  form,  the 
minimization  of  Eq.  (4)  yields  the  weighted  conditional-  mean  estimate 


where 
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vf<*kK  ’ Hk-<)dvf„<“k  ■ Hkizk> 


xk  ~ 

J/3ifw<“k  ‘ Kk|Zk> 

I ...I  F33Sk<V£XlZk> 

“k-l  Q1 

(4.5) 

£ 1...  ^ F3iVaJV 

1-1  “k-l  “l 

(4.6) 

\(V  = 

(4.7) 

f w * *J  V “ f w ( ak=Hk * ak- 1 ai  1 V ’ 

(4.8) 

and 


ak  = (Ctk,0tk-l ai)- 


The  corresponding  suboptimal  estimator  of  finite  memory  which  uses  all  the 
observations  from  the  most  recent  N stages  is  given  as  follows 

vs>k)f>k|Z") 


v*  _ “k-l  “k-N+l 


I'  l I F31£^Cbkl<) 


i=l  a,  ' a. 

k-l  k-N+1 


(4,9) 


where  - *k<VHk’Vl °k-W* 

" fw(tVHk,ak-l ak-N+l  I Zk}  ’ 


I 


39 


bk  “ .(ak,ak-l,,,'.’VN+l)‘ 


It  is  to  be  noted  that  the  set  of  necessary  equations  for  updating 
the  estimate,  the  variance,  and  the  weighting  coefficients  is  identical 
to  those  derived  in  Chapter  3. 

The  decision  rule  is. postulated  in  Eq.  (4.3),  and  the  observation 
space  A is  partitioned  into  three  disjoint  regions. 

Region  A^  in  which  r|3  is  true  is  true  is  bounded  by  the  following 
inequalities. 


Cli(Xk*Xk)f(xk*Hk’Zk)dXk 
< J.  h Cii(xk,xk)£ (xk’“k,Zk 


ft  C£.i^Xk’ Xk^ £ ^Xk,dk’ Zk^dxk’  £ ~ 2,3 


(4.10) 


In  order  to  evaluate  these  inequalities  one  has  to  substitute  the  cost  de- 
fined in  the  beginning  of  this  section  into  Eq.  ( 4.10)  and  recognize  the  de- 
finition of  the  covariance  matrix 


XkXkf  (xk ! °k  Hk ’ “k-l dk-N+l  ’ V dxk 


Vk(bk>  + xk(bk),xk(bk)T 


(4.11) 


L V»+i,zk)dx» 


TX<bk>  + \<bk)T-Sl(bk) 


(4.12) 


where  v^(hk)=vlc(ak=H^»ctk  l’ * * ,otk-N+l^  ’ and  TrVk^bk^  denotes  the  trace  of 


the  matrix  V^Cb^) . 


Def ine 


A-  l-  l Kl-b21>-£i<bklZk>  + <L12-L22)t«<bklZJ) 

ot.  , a k 


k-l  k-N+1 


+ «-i3-l-23)^(bt|z")  + <Fu-F23)(Tr*J(bk)  ♦ SjX^SjX)) 
•£>k|F^)}.  (‘■•13) 

B'  1...1  {au-L31)fi<bk|z”)  + (L12-L32)f^(bk|zjJ) 

0lk-l0lk-N+l 

* aU-L33)fw<b1X>  * (F13-F33)  (TrVk<bk>  + Sk<bk>T\<bk>)- 

'X^  - tF13£i(bklZk)  + F32fX'Zk>  + F33fv(bklZk»- 


<Sk)T'<Sk)  + F33'(Sk(bk,TSk  + <i!k)T\<bk,)'fw(bklZk)}’ 


and 


(A. 14) 


C = B - A. 


(4 . 15) 


It  is  noted,  from  che  definition  of  cost  incurred  for  joint  detection 

and  estimation,  that  the  estimate  x,  in  C..(x,  ,x,  ) should  be  essentially 

k 13  k k J 

replaced  by  x^  of  Eq.  (4.9). 

Upon  substituting  Eqs.  (4.13)  and  (4.14)  into  Eq.  (4.31,  one  can  deter 
mine  the  boundary  of  region  A^ . It  can  easily  be  shown  that  the  region  A^ 
is  bounded  by  A<0  and  B<0.  It  similarly  follows  from  Eq.  (4.3)  that  the  region 
A^  is  bounded  by  A>0  and  C<0,  and  the  region  A^  is  bounded  by  B^O  and  OO  • 

It  is  noted  that  tjie  decision  boundaries  determined  by  Eqs.  (A. 13)  - 
(4.15)  are  equivalent  to  the  boundaries  determined  by  setting  up  the  uodifieJ 

generalized  likelihood  ratios  [15]. 

The  procedure  to  generate  the  optimal  decision  is  summarized  in 
the  following: 

Step  1.  Evaluate  A and  B,  respectively  using  Eqs.  (,4.13)  ana  (4.14) 
Step  2.  Validate  one  of  the  following  three  statements: 

(i)  A<0  and  B<0,  decide  r)^ 

n 

(ii)  A>B  and  A>0,  decide 
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(iii)  B>A  and  B>0,  decide 

Step  3.  Calculate  the  optimal  estimate 

1 2 

(i)  Either  nk  or  is  true,  estimate  is  not  updated; 

V*k|k-1 

3 

(ii)  nk  is  true,  estimate  is  updated;  x^=x^. 

This  scheme  will  now  be  compared  to  the  PSE  scheme  introduced  in 
Chapter  3 . The  coupled  risk  formulation  is  not  particularly  meaningful  to 
the  PSE  , since  the  estimate  is  updated  no  matter  what  hypothesis  is  true. 
Thus,  rather  than  comparing  the  two  schemes  in  terms  of  the  coupled  cost, 
it  would  appear  more  appropriate  to  compare  the  two  in  terms  of  the  per- 
centage of  wrong  detection  and  the  variance  of  the  estimator. 

4.4  Simulation  Results 

The  message  model  and  the  observation  model  chosen  for  joint  detec- 
tion and  estimation  are  the  same  as  those  of  Chapter  3.  The  values  of 
matrices  L and  F in  Eq.  (4.1)  which  are  assigned  to  the  cost  of  maxing  decision 

and  the  cost  of  making  estimation  respectively,  are 


Oil 

0 0 1 

10  1 

F » 

0 0 1 

110 

111 

for  the  suboptimal  estimator  and  detector  of  finite  memory.  The  values  of 
matrices  L and  F also  indicate  the  interaction  of  coupling  of  the  joint 
operations,  which  is  weighted  equally  in  this  case. 

A one-hundred-step  process  was  simulated  and  ten  runs  were  averaged 
to  obtain  the  results  such  as  variance  and  percentage  of  wrong  detection 
under  five  ground  sensors.  It  is  assumed  that  the  a priori  probabilities 
in  the  case  of.  independent  hypotheses  and  the  transition  matrix  in  the 
case  of  markov-dependent  hypotheses  are  known  to  the  receiver. 
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TABLE  4.1 


3 

Variances  of  PSE  and  SEE;  POl^)  = 1* 


STAGE 

PSE 

SFE 

10 

.00056 

.00056 

20 

.00030 

.00030 

30 

.00021 

.00021 

40 

.00016 

.00016 

50 

.00013 

.00013 

60 

.00011 

.00011 

70 

.00009 

.00009 

80 

.00008 

.00008 

90 

.00007 

.00007 

100 

.00007 

.00007 

TABLE  4 • 2a 

Variances  of  PSE  and  SFEi  P(H*)=P(H^)=.l  and  P(H^) 

K K K 


STAGE 

PSE 

SFE 

10 

.00083 

.00085 

20 

.00041 

.00043 

30 

.00027 

.00029 

40 

.00021 

.00021 

50 

.00016 

.00017 

60 

.00013 

.00014 

70 

.00012 

.00012 

80 

.00010 

.00011 

90 

.00009 

.00009 

100 

.00008 

.00008 
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TABLE  4.2b 


Z Wrong  Detection  of  PSE  and  SEE;  P(h£)-P («£)*. 1 and  P(Hj|) 


STAGE 

PSE 

SFE 

10 

22 

20 

20 

26 

20 

30 

14 

.18 

40 

24 

16 

50 

22 

14 

60 

10 

16 

70 

42 

14 

80 

10 

24 

90 

16 

14 

100 

R 

32 

TABLE  4.3a 

Variances  of  PSE  and  SFE;  P(HjS  = .2  and  P (H^)  =P (H^)  = . 4 

K K K. 


STAGE 

PSE 

SFE 

10 

.00239 

■:  19106 

20 

.00109 

.03719 

30 

.00065 

.00131 

40 

.00049 

.00071 

50 

.00035 

.00054 

60 

.00029 

.00043 

70 

.00026 

.00036 

80 

.00023 

.00030 

90 

.00021 

.00026 

100 

.00019 

.00022 
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TABLE  4.3b 

X Wrong  Detection  of  PSE  and  SFE;  P(H*)=.2  and  P(1^)=P(H^)=. 4 

K K K 


STAGE 

PSE 

SFE 

10 

46 

60 

20 

50 

64 

30 

46 

56 

40 

48 

50 

50 

38 

58 

60 

34 

56 

70 

36 

56 

80 

32 

64 

90 

40 

50 

100 

40 

54 

TABLE  4.4a 


Variances  of  PSE  and  SFE;  Markov-dependent  Hypotheses 


STAGE  PSE  SFE 


10  .00385  3.48371 

20  .00090  5.25671 

30  .00102  5.17088 

40  .00058  4.11383 

50  .00055  3.63712 

60  .00041  2.86978 

70  .00030  2.31418 

80  .00025  1.46436 

90  .00023  1 02339 

100  .00021  ; 70642 


i 


TAbuE  4.4b 


Z Wrong  Detection  of  PSE  and  SFE,  Markov-dependent  Hypotheses 


STAGE 

PSE 

SFE 

10 

48 

64 

20 

36 

74 

30 

36 

74 

40 

26 

66 

50 

30 

62 

60 

• 38 

58 

70 

32 

62 

80 

38 

56 

90 

42 

64 

100 

48 

66 

TABLE  4.5 

Overall  % Wrong  Detection  of  PSE  and  SFE 


CASE 

PSE 

SFE 

P(H^)  - 1 

0. 

0. 

p(h£)  = P(H^)  - 

• 1 *P (H^)  = -8 

18.16 

21.36 

P(hJ)  = .2,P(H*) 

= P(Hj^)  =■  .4 

42.30 

57.90 

Markov-dependent 

35.18 

63.48 
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The  variances  shown  in  the  figures  of  this  chapter  are  the  variances 
associated  with  the  first  component  of  the  state  vector. 

For  the  case  of  P(H^)=1,  both  schemes  perform  equally  well  and  have 

zero  percentage  of  wrong  detection.  The  averaged  variances  at  particular 
stages  are  shown  in  Tables  4.1,  4.2a,  and  4.1a  f°r  t^'e  case  of  independent 
hypotheses.  As  expected  the  variance  of  PSE  is  lower  than  that  of  SFE. 

The  improvement  of  performance  of  PSE  over  that  of  SFE  becomes  pronounced 
in  the  markov-depcndent  hypotheses  case  shown  in  Table  4.4a.  Ihe  averaged 
percentage  of  wrong  detection  at  particular  stages  are  shown  in  Tables 
4.2b,  and  4.1h  for  the  case  of  independent  hypotheses  and  Tahle  4.4h  t 
for  markov-dependent  hypotheses.  The  overall  averages  of  percentage  of 
wrong  detection  are  shown  in  Table  4.5.  It  is  expected  that  the  percentage 
of  wrong  detection  for  PSE  is  lower  than  that  of  SFE,  and  it  is  approximate- 
ly fifty  percent  lower  than  SFE  for  the  case  in  which  the  hypotheses  switch 
in  markov  manner. 

The  structure  of  SFE  developed  in  this  chapter  is  very  complicated 
in  the  sense  that  it  takes  a larger  amount  of  computational  requirements 
such  as  execution  time  and  core  locations  as  compared  to  those  taken  by 
the  PSE.  The  simulation  results  clearly  show  the  superiority  of  the  PSE 
as  compared  to  the  SFE  designed  using  the  coupled  risk  formulation. 
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5.  TRACKING  OVER  FADING  CHANNELS 


A problem  encountered  in  many  applications  is  that  of 
distortion  of  the  transmitted  signal  by  the  communication  channel. 

The  distortion  introduced  by  the  channel  can  be  characterized  by 
multiplicative  disturbances  which  are  responsible  for  such  phenomena 
as  fading,  dispersion  and  multipath  [1],  As  can  be  expected,  the 
presence  of  such  phenomena  complicates  the  problem  of  target 
tracking  in  a multisensor  environment.  Explicit  algorithms  for 
the  probabilistic  estimator  for  this  problem  have  been  derived  and 
its  performance  evaluated  by  simulations  which  confirm  the  advantages 
probabilistic  estimator.  Details  of  the  algorithms  and  the  simulation 
results  are  given  in  [2]  which  has  been  included  as  Appendix  B of 
this  report. 


REFERENCES 


[1]  H.  L.  Van  Trees,  "Detection,  Estimation  and  Modulation  Theory", 
John  Wiley  and  Sons,  Inc.,  New  York,  1971. 

[2]  F.  S.  Chang  and  M.  D.  Srinath,  "Target  Tracking  Over  Fading 
Channels,"  IEEE  Trans.  Communications,  Vol.  COM-24,  No.  4, 
April  1976,  pp.  432-438. 


49 


6.  ESTIMATION  USING  QUANTIZED  MEASUREMENTS 
6.1  Introduction 

In  many  practical  situations,  the  observations  are  presented 
to  the  receiver  in  quantized  form.  In  general .measurements  will  be 
quantized  if  they  originate  from  digital  sensors,  in  digital  trans- 
mission systems  or  when  processed  by  digital  computers.  In  this  chapter 
we  will  derive  algorithms  for  the  problem  of  tracking  a target  in  a 
multi-target,  multi-sensor  environment,  when  the  observations  are  in 
quantized  form,  using  the  probabilistic  estimator  approach. 

It  is  well  known  |1 | that  the  problem  of  estimation  with  quantized 
measurements  can  be  considered  to  be  equivalent  to  the  estimation  of 
signals  in  the  presence  of  additive  white  noise. 

For  a measurement  equation  of  the  form 

z **  h(x,  v)  (6.1) 

the  conditional  mean  (minimum  mean  square  estimate)  of  any  function  f(x) 
given  quantized  measurements  zcA  where  A is  some  region  (e.g.  a hypercube) 
can  be  written  as 

» 

E{f(x)  |zeA}  - E{E[f(x)  |z]  |zeA}  (6.2) 

Thus  the  required  conditional  expection  is  obtained  in  two  steps:  (1) 

find  E(f(x) |z) : this  is  the  usual  goal  of  estimation  with  unquantized 

measurements;  (2)  find  the  expectation  of  E(f(x)|z)  conditioned  on  zcA. 

For  the  case  when 

x ■ N(x,  M) 
v - N(0,  R) 

z " Hx  + v 


E(xvT) 


(6.3) 
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where  N(*,*)  denotes  a normal  distribution  whose  mean  and  covariance 
are  the  first  and  second  arguments,  respectively,  x is  an  n-component 
vector,  z and  v are  m-component  vectors,  and  H is  an  ra  x n matrix, 
the  following  expressions  for  the  conditional  mean  and  covariance 
are  obtained  [ 1] 


E (x | zcA)  = x + K [E(z | zcA)  - Hx] 

T 

cov  (x | zeA)  = P + K cov(zjzcA)  K 


where  K is  the  minimum  variance  gain  matrix,  and  P is  the  covariance 
of  x that  would  be  obtained  had  the  measurements  not  been  quantized. 
These  results  show  that  quantization  increases  the  error  variance  of 
the  estimate  as  though  it  was  noise  added  after  the  measurements. 

Suppose  a number  of  measurements  are  made  at  different  times  t^, 
i - 1,  2 ...  . Since  the  measurements  at  different  times  are  correlated, 
the  conditional  density  of  z given  the  sequence  of  past  measurements 
will  change  as  more  measurements  are  taken.  Thus,  even  though  the  equa- 
tions may  be  solved  recursively,  E(z|zeA)  must  be  recomputed  after  each 
measurement;  this  necessitates  a complete  solution  of  the  equation  after 
each  measurement  is  taken.  We  can,  however,  obtain  an  approximate  re- 
cursive estimate  by  assuming  that  the  conditional  distribution  of  the 
state  just  prior  to  the  iC^  measurement  is  Gaussian  with  mean  and  vari- 
ance given  by  Eq.  (6.4).  For  example,  let  us  consider  the  linear  message 
and  observation  models 
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r 


^ — - 


= *k  xk + rk  wk 


yk  = ck  \ 


zk  - yk  + \ 


where  is  the  n x 1 state  vector,  and  is  a p x 1 output  vector. 
The  m-vector  is  white  zero  mean  gaussian  with  covariance 


E{Ye}>  Vke- 


Let  us  assume  that  the  observations  are  quantized  so  that  the  estimate 
of  the  state  x^  must  be  based  on  the  information  that 


b - b * b 


J * 1,  2 ...  k 


It  can  then  be  shown  [2,3]  that  the  approximate  algorithms  for  esti- 
mating x^  for  the  case  when  is  diagonal  are  given  by 

*k  " Xk|k-1  + Wk 

A A 

*k|k-l  “ ^k  Vl 

A ^ 

where  dfc  represents  a vector  whose  i component  is 


[f«Vi>  - xq-vV1 


toJi  • ISkb'Vkb 


•‘Vi 

(“tb  " l6kb!Vkb 

J ■’kb 

where  f(*)  denotes  the  density  function  of  the  i component  of  v^» 
is  a diagonal  matrix  whose  elements  are 

rn  ,-n  IT*  . - , l'>kbf([\b)-[ukiif([ukb)  " 
L<Dk>Jii-  [dkb[dkb 71^7 

(Rkbi  , , f([vkb)d[vkb  - . 

‘Sb  kk-«vi 


V*k|k-i 


f 


The  filter  gain  and  error  variance  equations  are  given  by 


\ • vi  fWk  * 

■V  ‘ Mk-i 
vk  • *k  11  - \-ick-i>vk-i^  + Wk 
We  now  derive  the  probabilistic  estimator  for  the  problem  of 
target  tracking  described  earlier.  Thus,  the  observation  model  for  any 
sensor,  corresponding  to  the  three  hypotheses  namely  (i)  noise  alone 
(e.g.,  cut-off  communication  link),  (ii)  false  alarm  (e.g.  thermal  or 
process  noise,  or  clutter  from  a vehicle  not  being  tracked),  and  (iii) 
the  vehicle  being  tracked,  is  given  as  follows: 

Hk  : Zk  = Uk 
Hv  : Zk  Ck  x(kfk_1)  + vi. 


(6.11) 


(6.12) 


k 

Hk  : Zk  = Ck  \ + u’ 


k ~k  “k  ' “k 

6.2  Probabilistically  Supervised  Estimation  with  Quantized  Measurements 
We  use  the  structure  of  the  probabilistic  estimator  de- 
rived in  Chapter  3,  in  which  a set  of  parallel  processing  units 

(PPU's)  is  used  to  process  the  sensor  returns.  On  receiving 

the  quantized  observations,  each  PPU  performs  a random  labelling  test 

to  generate  a label  corresponding  to  each  of  the  three  hypotheses. 

The  label  is  than  accepted  as  the  true  hypothesis  at  that  stage.  Depend- 
ing on  the  label,  the  observation  from  the  iC^  sensor  is  modeled  accord- 
ing to  Eq.  (6.13)  by  the  corresponding  PPU.  The  algorithm  for  com- 
bining the  observation  models  supplied  to  obtain  the  track  estimate  is 
discussed  later. 

We  can  now  consider  the  generation  of  the  random  labels. 

Suppose  there  are  M sensors  and  that  at  stage  k,  there  are  n^  returns, 

_<  M.  Let  Jt*  denote  the  label  generated  by  the  i PPU  at  stage  k 


(6.13a) 

(6.13b) 

(6.13c) 
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and  let  be  the  vector 

>*  -!*£*£••  • Ck]T 

H 

Let  L.  denote  the  set  of  all  past  labels  L , j c 1 • • • k. 

* J 

Similarly  let  denote  the  itb  sensor  return  at  stage  k and 


T T 
r 1 2 

[zk  zk 


n T 

k ,T 
zk  1 


Since  the  observations  are  quantized,  we  can  write 

.i  i ri  / c. 

b.  < z.  < b, 

-k  - k k 

Let  B denote  the  hypercube  in  n -dimensional  space  corresponding  to  the 

iC  & 

set  of  observations  at  stage  k.  Then 

(6.: 

Zk  eBk 

Let  denote  the  set  of  all  past  observations  , j = 1,  ...  k and 
let  correspond  to  the  quantized  observations.  Since  there  are  three 

hypotheses,  the  label  £,*  is  a ternary  random  variable.  When  a new  meas- 
urement z£+^  is  received,  the  PPU  updates  the  probabilities  associated 
with  the  three  outcomes  on  the  basis  of  the  past  measurement  se- 

quence  and  the  past  labelling  sequence  L^.  We  can  then  write  for 
these  probabilities 

f(*k+l  “ ^k+l 1 -k+1  - Zk+1  < bk+l*  ^ e Bk*  V 

_ wi  i ri  | . i _M  „M  T M.  , , . i ,Ji_MTM. 

_ Prob(-k+l-Zk+l<bk+l 1 g’k+l"Hk+l,ZkeBk>Lk)f (fck+l"Hk»l ^ ZkeBk,Lk) 

B V/Li  i ri  i „M  „M  TM, 

Prob(bk+i  < zk+1  < bk+1|Zk  e Bk,  1^) 


_ , » , i i ^ ri  i_M  „M  tM. 

Prob(bk+l  - *k+l  < bk+llZk  E V Lk) 


1 


J-l 


f (fck+l  “ Hk+]JZk.  C Bk’  Lk) 


(6.19) 


for  i ■ 1,  2 ...  n^;  J ■ 1,  2,  3. 

If  we  assume  that  the  hypotheses  are  independent  from  stage  to  stage, 
we  can  write 

«*£«  - e <•  # ■ f(£i  ■ Hi+i>  (6-20) 

k i i 1 M M M 

It  only  remains  to  determine  f ( I Zj^  e B{t»  • Let 

* m 

Xfc+i  |k^k^  ^enote  c^e  conditional  expectation  of  x^^  conditioned  on  the 
sequences  of  past  observations  and  labels.  It  then  follows  from  the 
measurement  models  of  Eq.  (6.13)  that 

f<vii4i  ■ «Li-  < e <■  ■£>  ■ y°-  Vi1  for  J • 1 


" MpICk+A+llkak'  ,!!k+l<Lk)1 


(6.20a) 

(6.20b) 

(6.20c) 


for  j*2 

> " NpICk+l^k+l | k(LkJ ’ Ck+lVk+l | k(Lk) Ck+l4Rk+l 3 

for  j®3 

A 

It  only  remains  to  evaluate  x^^  k^k^*  Unf°rtunately*  as  discussed 
earlier,  the  computation  of  this  estimate  is  not  easy  even  though  the 
signal  and  message  models  are  linear,  because  of  the  quantized  nature 
of  the  observations.  We  can,  however,  use  the  approximate  algorithms 

A if 

of  Sec.  2 to  obtain  the  estimate  x^-n  k^k^  * restrict  ourselves 

to  the  case  where  is  diagonal.  The  extension  to  the  case  of  nondiag- 

M 

onal  R^  is  straight  forward.  Let  denote  the  matrix 

f /pi  \^|  trp-  j (r&  ) (6.21) 

l(Ck+l)  ' (CV+1 } J 


4* 
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and  q*  and  denote  the  vectors 


i v*  i“ 

qk  " -k  ~ CkXk|k+l 

i -i  i“ 

“k  “ bk  ~ ck*k|k+l 

Let  d*  denote  the  vector  whose  j tb  component  is  given  by 


(6 

(6 


i fdqi].)  - fdujh) 


(6 


lujlj 

i f([vkVd[vk]1 
- J J 

where  []j  denotes  the  jtb  component  of  a vector  and  f(*)  is  the  (normal) 

density  function  of  the  jtb  component  of  v^.  Let  D*  denote  the  diagonal 

matrix  whose  elements  are  given  by 


' i,  ,,i, 


"i'jj 


f([vk]  )4I»kl  J 

t’Jlj 


(< 


Let 


and 


,M  f ,,1.T'  . ,2.Ti  i 

dk  * (<V  : (dkJ  ' * * ‘ (dk  * J 


_M 

Dk" 


'I  n*l 


Then  the  algorithms  for  determining  *^+1 1 are  given  by 


(f 


(6 


.22a) 

.22b) 

.23) 


>.24) 

.25) 

.26) 


l 
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Xk+l|k  " Vk 

A A MM 

Xk+1  " xk+l |k  + Kk+lDk+ldk+l 
where  the  filter  gain  equation  is 


' v: 


+ Dk+i> 


(6.27) 


The  error  variance  equation  is 


vk+i|k‘Wk  + rA  <6 

Vi  ■ 11  - WtiiViik  16 

Equations  (6. 18)-(6. 31)  constitute  a complete  set  of  approximate  algorithms 
for  estimating  the  state  of  a target  in  multisensor  environments  when 
the  observations  are  in  quantized  form. 

6.3  Simulation 

In  order  to  determine  the  performance  of  the  probabilistically 
supervised  estimator  with  quantized  measurements,  the  same  tracking  ex- 
ample considered  in  earlier  chapters  was  simulated.  The  system  parameters 
are  given  as  follows 

- x position  at  stage  k 
~ x speed  at  stage  k 


x3  - y position  at  stage  k 
x^  - y speed  at  stage  k 


- speed  change  between  stage  k and  k 

- speed  change  between  stage  k and  k 


r 

i 
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where 


1 

0 

0 

0 


T 

1 

0 

0 


0 

0 

1 

0 


0 

0 

T 

1 


. Q,. 


qu00  o 


r is  the  sampling  period. 


q2200 


Ck- 


10  0 0 
0 0 10 


r11(k)  0 


r22(k) 


In  the  simulation,  the  following  values  are  used 
T ■ 2 sec. 

p • 0 

rtjp  - 0.01 


x , multivariate  normally  distributed  with  dimension  four, 
o 

Na  (0,  0.01) 

Qfc  - diag  [0,0] 

- diag  [0.00828,  0.00828] 

Quantizer  interval  A * 2 ^ 

- 0.03125 
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1-5,  number  of  sensors 
Prob(H^)  - Prob(H^)  - 0.1 
Prob(H^)  - 0.8 

A 40  stage  process  was  simulated,  and  80  runs  were  averaged  to  obtain 
the  meai  square  error.  The  mean  square  error  In  the  x and  y positions 
for  the  first  10  stages  are  shown  in  Fig.  1.  Table  1 shows  a comparison 
of  the  mean  square  error  obtained  using  the  probabilistic  estimator  as 
given  in  [8]  with  the  results  obtained  with  quantized  measurements. 

As  can  be  expected,  the  error  is  somewhat  larger  with  quantized 
measurement  s . 
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TABLE  I 

MSE  Comparison  of  PSE(Q),  PSE 
P(h£)  - P(h£>  - 0.1,  P(H^)  - 0.8 


STAGE 

PSE(Q) 

PSE[8] 

10 

.02223 

.02071 

20 

.01122 

.00283 

30 

.00179 

.00159 

40 

.00087 

.00065 

60 

i«— 


7.  CONCLUSIONS 


This  research  has  considered  the  problem  of  joint  detection  and 
estimation  of  signals.  A specific  problem  of  tracking  a target  in  a 
multi-target  multi-sensor  environment  was  investigated.  This  problem 
is  somewhat  typical  of  the  type  of  problems  encountered  in  this  area. 

In  order  to  overcome  the  exploding  computational  requirements  of  the 
problem,  a suboptimal  scheme  which  uses  a probabilistic  judgement  to 
eliminate  extraneous  returns  was  derived.  Explicit  algorithms  for 
tracking  the  target  have  been  derived.  The  scheme  was  compared  with 
two  other  suboptimal  schemes  in  terms  of  performance  and  computational 
requirements.  The  tracking  algorithms  were  extended  to  include  cases 
in  which  the  observations  are  received  over  fading  media  or  in  quantized 
form.  Simulation  results  indicate  that  the  performance  of  the  scheme 
is  acceptable  under  these  conditions. 
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APPENDIX  B 


Target  Tracking  over  Fading  Channels 
F S,  CHANG  and  M D.  SRINATH 

Abstract  The  problem  of  tricking  a target  in  a multitarget  environ- 
ment when  the  observations  are  received  over  a fading  channel  is 
considered.  The  optimal  Bayes  solution  to  the  tracking  problem  in  such 
cases  involves  growing  memory  and  hence  is  not  feasible.  A particularly 
effective  suboptimal  scheme  uses  a probabilistic  judgment  at  each  stage 
of  the  observations  to  overcome  this  problem.  This  concise  paper 
presents  an  evaluation  of  the  scheme  in  terms  of  mean-square  error 
performance  when  the  observations  are  received  over  fading  channels. 


I.  INTRODUCTION 

Recently  there  has  been  an  increasing  interest  in  the  syn- 
thesis of  communication  systems  for  space  applications.  The 
communication  channel  modifies  the  transmitted  signal  so  that 
the  desired  message  may  arrive  at  the  receiver  terminal  dis- 
torted, attenuated,  and  delayed  The  message  is  usually  ob- 
served in  the  presence  of  additive  noise.  The  distortion  intro- 
duced by  the  channel  can  be  characterized  by  multiplicative 
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tlisturhanccs  which  are  responsible  for  such  phenomena  as 
fading.  dispersion,  mullipath.  phase  distortion,  and  time  delay. 

In  recent  years  considerable  effort  has  been  directed  toward 
the  application  of  stochastic  estimation  theory  to  the  problem 
o',  demodulation  of  modulated  signals  observed  in  noise  | I)- 
(4)  Snyder  1 5]  has  presented  a state-variable  model  for  a 
wide  variety  of  analog  communication  systems  and  channels 
for  a class  of  Markov  processes  He  has  also  presented  an 
analog  model  for  the  Rayleigh  fading  channel  in  terms  of  two 
quadrature  multiplicative  noise  processes  which  depend  upon 
the  frequency  of  the  transmitted  signal.  Stochastic  message 
and  channel  disturbance  piocesscs  were  formulated  as  com- 
ponents of  the  message  model  for  mathematical  convenience. 
The  effect  of  the  modulation  process,  multiplicative  noise,  and 
additive  noise  was  considered  in  the  observation  model.  The 
message  model  was  augmented  to  include  the  model  generating 
the  multiplicative  disturbances.  A summary  of  results  in  this 
area  pertaining  to  angle  modulation  is  given  in  ( 6 J . 

This  concise  paper  presents  a scheme  for  tracking  a target 
in  mulhtarpet  multisensor  environments  when  the  observations 
presented  to  the  receiver  are  received  over  such  channels  The 
dynamic  equation  of  motion  of  the  target  being  tracked  is 
assumed  to  be  modeled  by  a set  of  Gauss-Markov  difference 
equations 

+ l 

>*  = (I) 

where  ar*  rs  the  it  X I state  vector  and  y*  is  a p X ) output 
(message)  vector.  The  m-vector  co*  is  white  zero-mean  Gaus- 
sian with  covariance  is  an  it  X it  state 

transition  matrix,  and  Ph  is  an  n X ttt  excitation  matrix. 

If  a particular  sensor  is  not  locked  onto  the  target  being 
tracked,  the  corresponding  sensor  return  at  any  stage  can 
originate  from  one  of  three  sources  (hypotheses),  namely,  1) 
noise  alone  (e  g.,  cutoff  communication  link),  2)  false  alarm 
(eg  , thermal  or  process  noise,  or  clutter  from  a vehicle  not 
being  tracked),  and  3)  the  vehicle  being  tracked  The  receiver 
performs  under  uncertainty  of  reception  in  a ternary  case 
which  considers  all  the  possibilities  A suitable  model  for  the 
observations  under  each  hypothesis  can  then  be  formulated  as 
follows  1 71 . 

Under  //’ , the  observation  zk  is  given  by 

W*1  z*  = uk  (2) 

where  the  p X I vector  uh  is  a zero-mean  white  Gaussian  se- 
quence with  covariance  £{u*um}  = Rk^km  Under  hypothesis 
H2 , the  current  observation  is  not  used  in  tiack  updating  and 
the  one-stage  prediction  is  used  instead  The  effect  of  the 
extraneous  returns  can  be  modeled  as  a white  zero-mean  Gaus- 
sian sequence  iq,  with  covariance  £'{iq,um}  = It  is 

therefore  reasonable  to  model  the  observation  zh  under  hypo- 
thesis II2  as 

H„2  ■ z„  = lt\x(k  |k  - l),*l  + uk  (3) 

where  x(k  \ k - I ) is  the  one-stage  prediction  of  jr* . 

The  expression  for  the  covariance  kernel  fZ*  is  given  in  (9) , 
and  for  p = 2 simplifies  to 

= — (4) 

1 + - • nTF  • \/l  — pJ 

where  *£{(*»  - >*,*_,  X**  - Jr*,*-,  )T}.  nTF  is 

the  expected  number  of  incorrect  returns  in  one  sigma  region 
<7*  = nun  {(C*  C*T  + /?*)(<}l/l,  and  p is  the  correla- 


tion between  two  measurements,  and  where 


The  noise  sequences  u>t:.  uh.  and  vh  are  mutually  indepen- 
dent and  arc  uncorrelated  with  respect  to  the  initial  state  x0, 
which  is  multivariate  normally  distributed  of  dimension  n with 
mean  p0  and  covariance  l'0.  .Vn|p0- I • 

Under  II3,  die  observation  corresponds  to  the  vehicle  being 
tracked,  and  hence  we  have 

Mu3  zh  = k\x,„k\  + uk  (6) 

It  is  well  known  that  the  optimal  Bayes  estimator  for  this 
problem  involves  an  ever-growing  memory  and  is  hence' not 
feasible.  This  is  because  the  estimate  of  or*  at  each  stage 
depends  on  the  sequence  of  past  true  hypotheses  This  se- 
quence, however,  is  unknowm,  and  hence  the  estimate  must  be 
averaged  over  all  possible  past  sequences  In  the  pattern  recog- 
nition context,  this  is  usually  referred  to  as  unsupervised 
learning.  (The  situation  where  the  sequence  of  past  hypotheses 
is  known  corresponds  to  supervised  learning.) 

Let  Z), M denote  the  set  of  observations  up  to  stage  k,  and 
let  or),'  correspond  to  a particular  sequence  of  past  hypotheses 
at  stage  k.  We  can  then  write  for  the  estimate 

• xk  = E[x„  | Z„M) 

N k 

= £ E{xh  \Z„")  (7) 

i~\ 

where  Ni,(  = 3h ) represents  the  total  number  of  possible  se- 
quences at  stage  k.  It  is  clear  that  as  the  number  of  stages 
increases,  the  total  number  of  sequences  over  which  the  esti- 
mates must  be  averaged  increases  very  rapidly  We  therefore 
need  to  resort  to  suboptimal  schemes  which  use  a fixed 
memory  There  are  basically  two  approaches  which  can  be 
taken  to  solve  this  problem.  We  can  use  a fixed  number,  say  K, 
of  past  observations  in  arriving  at  our  estimate  Alternately, 
we  can  use  all  past  observations  in  our  estimation  scheme,  but 
choose  only  one  (or  more  generally,  a fixed  number  M)  of 
possible  past  sequences  ovei  which  to  average  A few  such 
schemes  have  been  discussed  in  |7)-[9) 

In  this  concise  paper,  we  use  a p-obabiiistic  scheme  to 
decide  at  each  stage  which  one  of  the  sequences  a*1  a*** 

corresponds  to  the  true  sequence.  The  effectiveness  of  this 
probabilistically  supervised  estimator  (PSE)  is  investigated  in 
terms  of  mean-square  error  and  variance  when  the  observations 
are  received  over  fading  channels. 


II  PROBABILISTICALLY  SUPERVISED  ESTIMATOR 

In  generating  a probabilistically  supervised  learning  se- 
quence, one  has  to  design  a probabilistic  judgment  at  each 
stage  of  observations  on  the  sensor  returns  18) , 19) . This  judg- 
ment is  carried  out  in  parallel  on  each  sensor  return  in  such  a 
way  that  each  return  is  assigned  a classification  label  which 
then  serves  as  its  true  hypothesis  We  can  think  of  the  scheme 
as  consisting  of  a set  of  parallel  processing  units  (PPU),  one 
corresponding  to  each  of  the  sensors,  and  a central  processing 
unit  (CPU)  which  combines  the  processed  returns  from  the 
PPU’s  to  update  the  estimate  of  the  target  track.  On  receiving 
the  observations,  each  PPU  performs  a random  labeling  test  to 
generate  a label  /*  corresponding  to  each  of  the  three  hypo- 
theses [8).  The  label  is  then  accepted  as  the  true  hypothesis  at 
that  stage.  Depending  on  the  label,  the  observation  from  the 
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nh  sensor  is  modeled  according  to  (2)-(4)  by  the  corre- 
sponding PTU  The  CPU  algorithm  for  combining  the  ob- 
servation models  supplied  by  the  PPU's  to  obtain  the  track 
es.imatc  is  discussed  in  a later  section 

We  can  now  consider  the  generation  of  the  random  labels. 
Suppose  there  are  M sensors,  and  that  at  stage  k there  are  nh 
returns.  nk  < A#  Let  lh‘  denote  the  label  generated  by  the  ith 
PPU  at  stage  *.  and  let  /.<*)  be  the  vector 

/.(*>  = |/»%J  -/*"*  lT- 

Let  / k"  denote  the  set  of  all  past  labels  10),/  = I " * 

Similarly,  let  zk‘  denote  the  ith  sensor  return  at  stage  k, 
and  let 

/<*>  = U»,VT"V*TIT 

Let  ZkM  denote  the  set  of  all  past  observations  Z(j),j  = I k 
Since  there  are  three  hypotheses,  the  label  /*'  is  a ternary  ran- 
dom variable.  When  a new  measurement  a*  + j‘  is  received,  the 
PPU  updates  the  probabilities  associated  with  the  three  out- 
comes on  the  basis  of  zk  , the  past  measurement  sequence 
ZkM , and  the  past  labeling  sequence  LkM  We  can  then  write 
for  these  probabilities 
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Ilf.  AUGMENTED  MODELS 

Typically,  the  signal  nonlinearly  modulates  a sinusoidal 
carrier  that  arrives  at  the  receiver  distorted  by  disturbance 
processes  encountered  in  the  transmission  media  These 
disturbance  processes  usually  include  both  additive  noise  and 
multiplicative  noise.  They  interact  with  the  channel  inputs  in 
a randomly  time-varying  fashion.  In  particular,  the  multi- 
plicative Gaussian  channel  noise  sequences  ft*  and  ck  are 
assumed  to  be  generated  by  linear  time-invariant  models  de- 
scribed by  two  one-dimensional  first-order  discrete  stochastic 
equations: 

*4  + 1 = 0kb>>k  + |l  ~<t>i,b\mkb  + ukb  (14) 

and 

G,+  i =<t>keck  + II  + (15) 

where  u>kb  and  u>kb  are  statistically  independent  scalar  zero- 
mean  white  Gaussian  sequences  with  variances  qkb  and  qke, 
respectively,  and  mkb  and  mkc  represent  the  time-invariant 
means  for  the  multiplicative  sequences  bk  and  ck,  respectively. 
Furthermore, 


/l***l'l  Zk”.Lk") 


(8) 


where 

/<**♦,'  I Z*".L>") 

j=  i 

•/(/*♦!'  I Z„",LkM). 

for  i = 1 ,2  • ■ n* ,/  = 1 ,2,3  (9) 

If  we  assume  that  the  hypotheses  are  independent  from  stage 
to  stage,  we  can  write 

/»*♦  t'  = **  + i'  I Zk *'.!.*")  = /!/*♦,' = //*♦!').  00) 

It  only  remains  to  determine  f(zk  + j'  I /*  « j1  = //*  + ZkM , 

t-kM)  Unfortunately,  the  determination  of  this  density  func- 
tion is  extremely  difficult  because  of  the  nonlinear  nature  of 
the  observation  models  We  can  seek  to  approximate  this 
density  function  by  a Gaussian  density  which  has  the  same 
first  two  moments  We  then  obtain 

/<**  v i'  I /*♦  = H„  + J.ZkM.LkM ) = Np\0.Rh  + . 

for  / = I (II) 

= *p|fi'<i*ti  ,»(/.„»).*  + I ).«»♦, '(/.*«)!, 

for  /=  2 (12) 

= fVp|  h'(i*  + ,,»(/.„*).*+  I). 

for  / = 3 (13) 

where  Np (m.o2)  denotes  the  p-vanate  Gaussian  density  func- 
tion wuh  mean  w and  variance  a2,  and  AT*,  Uk,  and  Ck  arc  as 
discussed  earlier  The  superscript  i denotes  that  the  quantities 
correspond  to  ihe  ith  sensor. 


mkb  = 0,  for  Rayleigh  channel 

m kb  = 7,  for  Rician  channel 

and  y is  the  specular  component  |9] . 

In  order  to  complete  the  augmented  message  model,  one 
should  adjoin  the  message  equation  of  (!)  and  (14)  and  (15) 
in  any  order.  For  instance,  one  may  write 

*»  + l = 4>kxk  + r\w,,  + \l/kmk  (16) 

where 


*1. " 

‘ 0 

*k  = 

v; 

. = 

. mk  ~ 

mkb 

_0 1 _ 

_m*e 

1 

0 

1  

'M  o 1 

<Pk  = 

1 

<4 

0 ^ 

^ ° 
o 

1 

. = 

■'  1 0 
o : 

•0  1 

and 


0 

° 1 

(l-4>a6)  0 

0 

(1  -*»C)J 

For  the  augmented  state  model,  the  observation  model 
generated  by  the  ith  PPU  is  given  by  (see  (2),  (3),  and  (6)| 


•f  Ik1  - Hit* 

(18) 

if  lk‘  = tfka 

(19) 

and 
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V = + u*.  if  3 <:o> 

where  represents  the  one-stage  prediction  of  V|, > 

IV  LSTIMATOR  ALGORITHMS 

We  note  from  (16)  and  ( IR)-(20)  that  the  message  and  ob- 
servation equations  are  nonlinear.  We  again  resort  to  a wide- 
sense  or  Gaussian-fit  approximation  to  the  relevant  a posteriori 
density  functions  (Lv*  I Z,,M  ) and  /l.vfc  + j I /.fc'u.  Z^M). 

Let  us  make  the  following  definitions: 

**♦1*  + I]  + l|}r  Cl) 

a 1M\  _ i ,,  ,,,, 

77  = ( — ) ^*  + 1 - xh*  <-.) 

♦Ilk  \ “HI  / 


idiag  (/?„♦, 1 " Rx  + ,"k  I (23) 

Using  a first-order  Taylor  series  approximation  about  the  one- 
stage  prediction,  the  following  equations  for  updating  the  first 
two  moments  of  the  density  functions  can  easily  be  obtained. 
These  equations  are  similar  to  the  extended  Kalman  filter 
algorithms  (II). 

**♦11  »«-*")  = <t>JkUV')  + **■<»  (24) 

^ + Hk(ikM)  = 0k^(i*M)0*T  + r)lOr,r),T  (25) 


*»  + i(£fc  + ,*)  = Vniu.Uk") 


TABLE  1 

At  (.OKI  I UStS  I OK  TaRC.PT  THACKINC. 


/ton  + i"\T 

\«*+u*  / 

[(££)• 


7»  + 1U.*  + 1M)  = :?»  M ,*(/.**> 

+ A.'*  + i (A* ♦ i )!**+  iM  -^fiM)  (27) 


*k*iM  =A*+i"(^>ii»a*").*  + «>. 


Augmented  M,.,|.  i 


<tbs>  r vjc  ton  *V>Jcj  • 


Labelling  A Igor  it  Ira 
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Utiattlon  Algorithm*: 
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*k*l<Lk*l)  " *k»llk(Lk>  * Kk*l(LkM)  * l'k»l  ‘ hk»lJ 


\*V  I *♦!  k»l 
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Equations  (24M29),  when  used  in  conjunction  with  the 
labeling  algorithm  of  (8HI3),  constitute  a complete  set  of 
algorithms  for  target  tracking.  The  algorithms  are  summarized 
in  Table  I. 

V SIMULATION  RESULTS 

We  consider  the  problem  of  multiple-sensor  tracking  of  an 
aircraft  in  the  presence  of  fading.  The  message  model  is  as  in 


(7)  in  which  the  parameters  of  the  aircraft  being  tracked  are 
given  as  follows: 


x : position  at  stage  k 
x : speed  at  stage  k 
y position  at  stag'  k 
y : speed  at  stage  k 


change  in  ar -speed  between  stage  * and  stage  k + I | 
change  iny-speed  between  stage  k and  stage  * + I J(jx  i> 

"i  r o on  ro  o“ 

0 10  0 10 

r»  = 

0 0 1 T 0 0 

.0  0°  I « tax  4)  .0  l.taxg) 
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mi 


and 


0 

<722(*)J(JX2> 


= y/Ix2(k)x6(k)  cos  (* IV, ) + y/~2x1(k)x6(k)  sin  (ArH>1 ), 


h2  = \f2x3(k)Xf,(k)  cos  (kW2)  + >/2x3(k)x6(k)  sin  {kW2)% 


where  T is  the  sampling  interval 

The  additional  state  space  associated  with  the  fading 
channel  processes  is  represented  by  two  first-order  Butter- 
worth  spectra  given  by 

2Kbob2 

5*(w)  = Se(w)  = (30) 

w2+AT6a 

where  Kb  is  the  half  power  frequency  or  one-sided  bandwidth 
and  ob 3 is  the  spectral  power.  The  two  corresponding  one- 
dimensional discrete  equations  are 

**  + 1 = t~K*>Tbi , + 1 1 — e‘~KbT]ullb  + (31 ) 

and 

f*+i  = *~K*Tfk  + II  — e~**T|u»c  + cj*c.  (32) 

The  excitation  covariances  are 
9bb  = o6*(l  -e~2KbT) 
and 

4a*  = 0**0  —e~2KbT). 

We  assume  that  the  message  amplitude  modulates  a sinusoidal 
carrier  whose  frequency  is  large  compared  to  the  significant 
frequency  of  the  message,  so  that  the  signal  component  of  the 
modulated  message  has  a bandpass  spectrum  that  is  essentially 
disjoint  from  that  of  the  message  In  this  case,  wc  can  write 
I in  the  observation  model  of  1 1 ) as 

*I7*.*I  = IM2|t 

where 


Wj  and  W2  are  the  carrier  frequencies  of  ^--measurement  and 
^-measurement,  respectively,  x S(A)  = bb,  and  x6(k)  = ck.  The 
results  for  nonfading  channels  can  be  obtained  by  setting 
X5  = 1 and  x6  = 0 in  these  equations. 

The  following  set  of  parameters  were  used  in  simulations: 

T = 0.002 
**  = 30 

P(Hh')  = P(Hh2)  = 0.1  and  />(//*»)  = 0.8 
Rb  = 0.1656 
7 = 5 and  10 
Pj  = 3 1 .4 1 and  W2  = 62.82 
St(0)  = -l0dB 

A 100-stage  process  was  simulated,  and  ten  runs  were 
averaged  to  obtain  the  mean-square  errors  as  well  as  the  vari- 
ances associated  with  the  first  component  of  the  augmented 
state  vector  under  five  ground  sensors.  The  results  are  shown 
in  Figs.  1 and  2.  As  may  be  expected,  the  curves  associated 
with  fading  channels  are  bounded  from  below  by  those  for  the 
nonfading  channel.  It  may  also  be  expected  that  the  transition 
from  the  Rayleigh  model  to  the  known  signal  model  may  be 
obtained  by  varying  the  value  of  the  specular  component  7 in 
the  Rician  model,  with  7 = 0 corresponding  to  the  Rayleigh 
channel  and  7 = *°  to  the  completely  known  signal  case.  The 
corresponding  normalized  mean-square  errors  are  defined  as 

F|(Xi,  ~xh)(xh  — xlt)rl1i 

*'!*«,  **T  In 

(where  for  any  matrix  A.  [A\a  denotes  the  i/th  entry)  and  are 
plotted  in  Fig.  3. 
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VI  CONCLUSIONS 

This  concise  paper  has  presented  a set  of  algorithms  for 
tracking  a target  in  a multitarget  environment  when  the  ob- 
servations are  received  over  a fading  channel.  The  problem  of 
exploding  memory  is  overcome  by  using  a probabilistic  scheme 
in  which  a random  label  is  accepted  as  the  true  hypothesis. 
The  effects  of  the  fading  channel  are  taken  into  account  by 
augmenting  the  system  model  suitably.  Since  the  observation 
model  is  nonlinear,  the  computation  of  the  relevant  density 
functions,  both  for  obtaining  the  random  label  and  for  state 
estimation,  is  difficult.  An  approximate  scheme  which  usta  a 
Gaussian  fit  to  these  density  functions  is  used  to  obtain  explicit 
algorithms.  The  simulations  show  that  the  scheme  gives  accept- 
able performance. 
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APPENDIX  A 


STATISTICS  OF  EXTRANEOUS  RETURNS 


The  output  equation  for  extraneous  sensor  returns  (hypothesis  H^) 


was  modeled  as 


Ck  Xk|k-1  + \ 


(A.  1) 


where  v is  assumed  to  be  Gaussian,  white  with  zero  mean  and  covariance 
The  assumption  of  whiteness  of  the  miscorrelations  has  been  discussed  in 
[1]  where  it  has  been  shown  by  Monte  Carlo  simulations  that  this  is  a good 
representation  for  predicting  performance  in  dense  environments. 

In  order  to  derive  an  expression  for  $1^,  we  can  follow  the  derivation 
in  [2]  and  assume  all  returns  fall  within  a region  (gate)  G in  the  ob- 
servation space  and  let  the  size  of  the  gate  tend  to  infinity.  In  order 
for  (A.l)  to  represent  an  incorrect  return,  the  corresponding  return  must 

A 

be  the  closest  to  C^  x^j^  ^ given  that  the  closest  point  is  an  incorrect 


return.  Let 


*k  * Ck  \lk-l  Ck 


T 

Ck+Rk 


where  ^ and  R^  have  been  defined  in  Chapter  2 and  let 

°k  " mln  t(Vnjl/2 


(A.  3) 


and  let  M be  some  integer. 


We  then  define  a region  £ as 

6 ■ {*JZk  ‘ Zk  “ Ck  Xk|k-l;||zkH2^-1  ±Mok} 
Similarly  let  D be  the  region  such  that 


(A. 4) 


5 " {yklyk  * ck  *k|k-i  + V I lykl  I ^ -II  \M  *k-l}  (A-5) 

It  then  follows  that  there  should  be  no  other  returns  with  residuals  in 
the  region  G n D,  so  that  the  density  function  for  given  that  is 
an  extraneous  return  is 
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! 


f(v  |v  is  residual  of  an  incorrectly  correlated  return) 

= f(Vk  other  return  in  G (1  D) 

P[no  other  return  in  G fl  D|V  )f(v  ) 

_ _ ft Jc 

P[no  other  return  in  G fl  DJ 

If  we  assume  that  all  incorrect  returns  are  uniformly  distributed  for 
e G and  that  G fl  D can  be  replaced  by  6 with  little  error  (see  [2]), 
we  can  write 


P [no  other  return  in  G n Dlvjtl 


■ P [no  correlation  is  made  | proper  return  is  detected]  PQ 
+ P [no  correlation  is  made  | proper  return  is  not  detected]  (1-P^) 


( 2tt > n/ 2 1 | 1/2  'G 


r 1 TF  1 

exp  {"  4 Hi  ~ dyk} 

4 U.  ' D K 


«xp(-|||;k||  -Dd^) 


where  PQ  denotes  the  probability  of  detection.  We  note  that  the  denomi- 
nator of  Eq.  (A. 6)  is  just  a normalizing  constant.  The  preceding  equation 
for  p - 2 reduces  to  [2] 

f (v  | no  other  return  in  G fl  D) 

- const.  exp{-  1/2||VjJ|^  ”1)  (A. 


where 


- i + ■£  " nTF  /i  - 

Here  p is  the  correlation  between  two  measurments,  and  n^.^  is 
the  expected  number  of  incorrect  returns  in  a one-a^  region. 
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